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(57) Abstract: Disclosed is a method for 
providing data useful for quantitatively analyzing 
metabolites, proteins and/or RNA in a biological 
source material, whereby said analysis involves 
suitable statistical evaluation and correlation 
analysis on the data obtained, preferably also 
including network analysis, said method being 
characterized by the step of extracting, identifying 
and quantifying at least two compound classes of 
the group consisting of ineiaholites, proteins and 
RNA from at least one sample from said biological 
source material, wherein the compounds of said 
at least two classes are each determined from one 
sample. Furthermore disclosed is a corresponding 
method for qualitatively analyzing metabolites, 
proteins and/or RNA in a biological source 
material com pn si i pr id d ita on metabolites, 
proteins and/or RNA, performing suitable 
statistical evaluation and correlation analysis 
on the data obtained; and optionally further 
performing a network analysis. Also disclosed 
is the use of a mixture of solvents suitable for 
extracting RNA, proteins and metabolites from 
a biological sample for extracting metabolites, 
and optionally also proteins and/or RNA from a 
biological sample in order to perform metabolite 
profiling. 
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COMBINED METABOLOMIC, PROTEOMIC AND TRANSCRIPTOMIC ANALYSIS FROM ONE, 
SINGLE SAMPLE AND SUITABLE STATISTICAL EVALUATION OF DATA 



The present invention relates to a method for providing data useful for quantitatively 
analyzing metabolites, proteins and/or RNA in a biological source material, whereby said 
analysis involves suitable statistical evaluation and correlation analysis on the data 
obtained, preferably also including network analysis, said method being characterized by 
the step of extracting, identifying and quantifying at least two compound classes of the 
group consisting of metabolites, proteins and RNA from at least one sample from said 
biological source material, wherein the compounds of said at least two classes are each 
determined from one sample. The invention furthermore relates to a corresponding 
method for quantitatively analyzing metabolites, proteins and/or RNA in a biological 
source material comprising providing data on metabolites, proteins and/or RNA, 
performing suitable statistical evaluation and correlation analysis on the data obtained; 
and optionally further performing a network analysis. Moreover, the invention relates to 
the use of a mixture of solvents suitable for extracting RNA, proteins and metabolites 
from a biological sample for extracting metabolites, and optionally also proteins and/or 
RNA from a biological sample in order to perform metabolite profiling. 

In the post-genomic era, a functional assignment of genes is getting more and important 
that goes beyond simple homology search and rough estimations of biochemical and 
biological roles. Instead, the scientific community tends towards a more comprehensive 
understanding of biology. Accordingly, the function of a gene is now regarded as 
depending on developmental and environmental changes as well as on the actual 
expression level of other genes rather than a simple, linear cause-event relationship. This 
change in paradigm will necessarily lead to the need to experimentally describe the state 
of biological tissues in depth on different levels, i.e. not only on the level of transcripts or 
protein expression, but also on the metabolite level and the interrelatedness of these 
levels. For example, Fiehn (Comp. Funct. Genom. 2 (2001), 155-168) considers a 
combination of results from in depth characterization of genetically altered organisms 
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using transcriptomics, proteomics and metabolomics in order to more comprehensively 
understand living organisms more feasible than was imagined before. Metabolites are the 
end products of cellular regulatory processes, and their levels can be regarded as the 
ultimate response of biological systems to genetic or environmental changes. In parallel 
to the terms "transcriptome" and "proteome", the set of metabolites synthesized by a 
biological system compose its "metabolome". 

In the growing fields of transcriptomics, proteomics and metabolomics, there is a rapidly 
ongoing progress in improving the techniques necessary for collecting data and for 
processing the data obtained. Current strategies and limitations for the quantitative 
analysis of cellular responses at all of the three levels mRNA, proteins, and metabolites 
have been recently summarized in a short review by Fiehn (Curr. Opin. Biotechnol. 12 
(2001), 82-86) including thoughts on database requirements and informatic tools. 
Today, transcriptomic approaches seem to give the best coverage of genome level 
responses. However, due to limitations in analytical precision and high costs, few 
transcriptomic studies adequately meet rigid statistical requirements. There are several 
prominent analytical approaches used for transcript profiling: microarray-based 
approaches (cDNAs or oligonucleotides) (e.g. reviewed in Granjeaud, Bioessays 21 
(1999), 781-790); sequencing-based approaches, such as serial analysis of gene 
expression (SAGE) (Velculescu, Science 270 (1995), 484-487) or massively parallel 
signature sequencing (MPSS) (Brenner, Nat. Biotechnol. 18 (2000), 630-634); and 
differential-dispiay-based approaches, such as arbitrarily primed (AP) PCR (Welsh, Nucl. 
Acids Res. 20 (1992), 4965-4970) and cDNA-amplified fragment length polymorphism 
(AFLP) (Bachem, Plant J. 9 (1996), 745-753). Of all of these, microarray technology has 
received the most widespread interest. This technique has been used extensively in 
yeast and human work, and thousands of expression profiles have been examined (e.g. 
Hughes, Cell 102 (2000), 109-126). 

For proteomic approaches, two-dimensional gel electrophoresis (2DE) is well established 
in many biological laboratories (Thiellement, Electrophoresis 20 (1999), 2013-2026) and 
is comparatively inexpensive. However, if the full set of proteins separated by 2DE gel is 
to be identified, highly automated systems are needed for cutting spots, digesting 
proteins, and analyzing peptides using mass spectrometry. Capillary isoelectric focusing 
is an alternative to 2DE that can be directly coupled to ion cyclotron resonance mass 
spectrometers (Fourier-transform mass spectrometry, FT-MS) for the analysis of both 
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crude protein mixtures (Jensen, Electrophoresis 21 (2000), 1372-1380) and complex 
peptide digests (Gao, J. Microcolumn 12 (2000), 383-390). Further advances have been 
made with approaches where liquid chromatographic (LC) separation has been 
combined with mass spectrometrical detection (MS). For instance, LC/LC/MS techniques 
allow more sensitive and rapid separations of complex peptide mixtures when cation 
exchange is coupled to reversed phase LC columns. This strategy has been shown to be 
superior in speed, robustness and protein coverage, when compared with 2DE analysis 
(Link, Nat. Biotechnol. 17 (1999), 676-682). According to a rather new technique, 
quantification of protein abundances can be performed using isotope coded affinity tags 
(ICAT) with precisions as accurate as 12% relative standard deviations (Gygi, Nat. 
Biotechnol. 17 (1999), 1112-1118). But to date, this technique has not been utilized for 
proteomic studies that go beyond one-to-one comparative experiments. 
Compared to transcriptomic and proteomic approaches, analytical techniques for 
metabolite detection and quantification are far more robust and mature. Analytical 
precisions may be below 1% relative standard deviations, and dynamic ranges may 
exceed four orders of magnitude. For instance in plant research, mass spectrometry has 
been used for decades to determine metabolic target compounds, but only recently has 
the idea been pursued of expanding the list of targets in order to profile a limited number 
of primary metabolites. Moreover, estimates of the total number of metabolites different 
members of the plant kingdom synthesise, that is estimated of plant metabolome sizes, 
range from 90,000-200,000. To determine comprehensive metabolomes, therefore, one 
must cope with the sheer complexity of mass spectra found in chromatograms and apply 
new technologies for efficient de novo compound identification. Most recently, high- 
througput profiling of metabolic snapshots has been demonstrated for the first time in the 
context of plant functional genomics (Fiehn, Nat. Biotechnol. 18 (2000), 1157-1161). 
Profiles of 326 metabolites of two Arabidopsis ecotypes were compared with two single 
gene mutants. Cluster analysis revealed distinct "metabolic phenotypes" for each of the 
four genotypes. 

On the other hand, evaluation of the profiling data from transcriptome, proteome and 
metabolome by bioinformatics is also currently in progress. On the basis of the 
assumption that large sets of genes and proteins follow synchronized patterns, attempts 
at taming profiling data have been made using clustering algorithms that group raw data 
in an unbiased way (Eisen, Proc. Natl. Acad. Sci. USA 95 (1998), 14863-14868). 
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Clustering allows genes, proteins or metabolites to be grouped according to their profiles. 
All clustering methods used so far, however, have significant drawbacks (Bittner, Nat. 
Genet. 22 (1999), 213-215) that make them unsuitable for detecting complex 
relationships in data networks. The inability to detect non-linear correlations is one such 
limitation. A new way to cover similarities and correlation of expression profiles based on 
mutual informational entropy has been proposed by Butte (Proc. Natl. Acad. Sci. USA 97 
(2000), 12182-12186). Until recently, all clustering methods also lacked a means by 
which to integrate statistics. Each gene or experiment was mapped into n-dimensional 
expression space without covering the fact that the sharp point within this space is merely 
a probability cloud. Hughes (Cell 102 (2000), 109-126) made an attempt to include 
statistics in their clustering methods so as to see which clusters were trustworthy. In 
addition, profiling bioinformatics is moving towards methods that try to incorporate as 
much available knowledge as possible. Recently, two groups used statistical methods to 
correlate expression profiles with potential promoter sequences (Jensen, Bioinformatics 
16 (2000), 326-333; Tavazoie, Nat. Genet. 22 (1999), 281-285). Marcotte (Nature 402 
(1999), 83-86) tried to elucidate the functions of unknown ORFs on the basis of five 
different data sources, including expression profiling data. Brown (Proc. Natl. Acad. Sci 
USA 97 (2000), 262-267) used a special neuronal network to identify potential gene 
functions on the basis of the expression profile and the transcriptional patterns of well- 
annotated genes. 

Yet, with regard to the goal of comprehensively understanding living organisms, there is 
still an ongoing need for improved data mining tools, and better tools for integrating the 
results of experimentally determined molecular phenotypes with predictions made by 
computational simulations of cellular networks (Fiehn, Comp. Funct. Genom. 2 (2001), 
155-168). 

In view of the above outlines, it is evident that the developments in obtaining 
comprehensive data on the levels of RNA, proteins and metabolites and the attempts to 
derive therefrom new insights into the comptex regulation of gene expression appear to 
be quite promising, however, also require further improvements. 
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Thus, the technical problem underlying the present invention is to provide a method that 
allows it to improve the meaningfulness of correlation data on gene expression and 
metabolite states of a given biological source material. 

This technical problem is solved by the provisions of the embodiments as characterized 
in the claims. 

Accordingly, the present invention relates to a method for providing data useful for 
quantitatively analyzing metabolites, proteins and/or RNA in a biological source material, 
whereby said analysis involves suitable statistical evaluation and correlation analysis on 
the data obtained, preferably also including network analysis, 

said method being characterized by the step of extracting, identifying and quantifying at 
least two compound classes of the group consisting of metabolites, proteins and RNA 
from at least one sample from said biological source material, wherein the compounds of 
said at least two classes are each determined from one sample. 

The particular achievement of the present invention lies in the fact that at least two of the 
three compound classes RNA, proteins and metabolites are each extracted and analyzed 
from one single sample. Accordingly, the phrase "the compounds of said at least two 
classes are each determined from one sample" means that, for each sample collected in 
the course of applying the method of the invention, the compounds of all the compound 
classes analyzed are extracted from one and the same sample. 

The method of the present invention may be carried out altogether taking one sample or 
taking more than one, preferably a multitude of samples. For example, to provide data 
from one sample may be applicable if corresponding reference data is already available 
from other samples, e.g. in a database. Data from a series of samples can for instance be 
useful for investigating a process along a time course, such as a developmental process. 
On the other hand, samples taken from different genotypes of organisms being at the 
same developmental stage may likewise deliver useful data, e.g., for characterizing the 
influence of a particular genotype on gene expression, protein composition and/or 
metabolite composition. 
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The method of the present invention represents a clear improvement over prior art 
techniques for providing data, e.g. for network analyses, since there data on RNA levels, 
protein levels and/or metabolite states were each sampled individually, i.e. determined in 
different samples. Thus, aiming at combining data sets from e.g. two compound classes 
hitherto was hampered by the uncertainty of whether the individual data sets actually 
represent corresponding cellular states. Since cellular states may vary depending on 
developmental or environmental conditions or on the genotype, it is often uncertain 
whether such data sets to be combined are indeed compatible, i.e. so that any 
correlations measured, e.g. between metabolite data and gene expression data, indeed 
reflect the actual processes in the cells under investigation. In addition, prior art attempts 
to combine data from different compound classes may suffer from calibration problems. 
In particular, these occur when one wants to directly set absolute values from one data 
set into relation to absolute values from another data set. To do so would require to have 
control values within each data set that allows it to calibrate the data. These drawbacks, 
i.e. the uncertainty of potential non-compatibility and calibration problems, are largely 
overcome or at least substantially reduced by the provisions of the present invention. By 
deriving the quantitative data on two or three different compound classes each from one 
single sample, it is guaranteed that the data sets obtained reflect just the same cellular 
status of the biological source material. Also calibration should present a decisively 
reduced problem if the data sets are obtained according to the method of the invention. 
Since the amount of material extracted is the same for each compound class analyzed of 
one sample and the data on the individual compound classes are derived from the 
identical cells, it can be expected that absolute values of different compound classes are 
directly correlatable without need of calibration. Still if, due to a certain way of 
subsequently extracting the different compounds from one sample, the absolute values 
finally obtained will not be directly correlatable, the person skilled in the art knows how to 
relate the values obtained to the amount of starting material extracted so that the 
absolute values may resume their direct correlatability. 

The method of the present invention relates to the provision of quantitative data on at 
least two compound classes in a biological source material. The term "compound class" 
relates to a group of compounds which is either metabolites, proteins or RNA. In a 
preferred embodiment, data on all three compound classes are obtained by said method. 
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Also preferred is the combination of metabolites with either proteins or RNA. The term 
"metabolite" refers to any substance within a certain biological source material that is non- 
peptidic and not a nucleic acid molecule. Preferably, the metabolites addressed by the 
present invention rather have a lower molecular weight, i.e. for instance not more than 
4000 Da, preferably not more than 2000 Da, more preferably not more than 1000 Da. 
Typically, the metabolites to be analyzed belong to the following, however non-limiting list 
of compounds: carbohydrates (e.g. sugars, oligo- and polysaccharides such as 
polyglucans as for example starch or polyfructans), sugar alcohols, amines, amino 
alcohols, aliphatics, aliphatic alcohols, amino acids, lipids, fatty acids, fatty alcohols, 
organic acids, organic phosphates, organic or anorganic ions, nucleotides, sugar 
nucleotides, sterols, terpenes, terpenoids, flavons and flavonoids, glucosides, carotenes, 
carotenoids and cofactors. 

The term "proteins" refers to any molecules comprising amino acids connected via 
peptide bonds that are present in the biological source material. 

The term "RNA" refers to any ribonucleic acids that can be present in cells contained in 
the biological source material. Preferred RNA is mRNA, i.e. transcripts from protein- 
encoding genes. The present method may also be applied by detecting other RNA 
species than mRNA such as rRNA, tRNA or viral RNA. 

The term "biological source material" means any material being or containing living 
matter, such as cells, tissues, organs or organisms. The method is not restricted to any 
taxon. Thus, prokaryotes such as archaebacteria or eubacteria (gram-positive or gram- 
negative) as well as eukaryotes such as yeasts, fungi, plants (i.e. algae or land plants, in 
particular flowering plants) or animals as for instance insects or vertebrates, especially 
mammals, and corresponding cell cultures are within its reach. As a preferred biological 
source material, plant tissue, e.g. leaves, can be used. The feasibility of the method of the 
invention has first been demonstrated at Arabidopsis leaf (see Example). Other preferred 
biological source materials may be the classical objects of genetics, such as Drosophila, 
E.coli, yeast, especially Saccharomyces cerevisiae or C. elegans where lots of mutants 
are described and regulatory networks integrating, e.g. metabolites and gene expression 
data, are promising to reveal new insights into gene regulation, e.g. during development. 
In a further important aspect, the method of the invention may help to improve drug 
discovery or cancer research. In this context, corresponding biological source material 
would for example be derivable from animals such as vertebrates, typically birds or 
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mammals, tn connection to this, preferred examples of biological examples may be 
mammalian, preferably human cell lines or tissue from test animals such as laboratory 
mice or rats. 

The term "sample" encompasses any amount of material taken from the biological source 
material that is susceptible to the method of the invention. For instance, a sample can be 
fresh material such as a tissue explant, a body fluid or an aliquot from a bacterial or cell 
culture, preferably deprived of the culture medium, that may be directly subjected to 
extraction. On the other hand, samples may also be stored for a certain time period, 
preferably in a form that prevents destructive action of inherent enzymatic activity, e.g. 
frozen, for instance in liquid nitrogen, or lyophilized. 

Extraction should be carried out so that the compounds one is interested in are dissolved 
as completely and quantitatively as possible and in a manner that they can later be 
identified and quantified using appropriate methods. The term "extracting" thereby refers 
to contacting the material containing the compounds of interest with an extractant (e.g. 
solvent or mixture of solvents) so that these compounds can be dissolved, followed by 
separating the solution from the undissolved matter. The person skilled in the art is 
capable of choosing an appropriate extraction protocol that is suited to isolate the two or 
three compound classes of interest and which is useful for the biological source material 
the sample is taken from. In the prior art, methods are described that may be especially 
useful within the context of the present invention, such as subsequent application of pure 
solvents or methanofwater mixtures, or other methods such as microwave extraction, 
ultrasonication, or accelerated solvent extraction. 

Usually, it is advantageous to use an extractant that contains or is an organic (i.e. 
lipophilic) solvent so that the proteins are denatured upon addition of the extractant so 
that other compounds, i.e. metabolites and/or RNA will not be enzymatically degraded or 
modified. Stopping of inherent enzymatic activity can also be achieved by freeze 
clamping, immediate freezing in liquid nitrogen, or by acidic treatments using perchloric or 
nitric acid (ap Rees and Hill, 1994). Although advantageous for extraction of amines 
(Bouchereau et al. 2000), acidic treatments may possibly pose problems for many 
subsequent analytical methods. In a preferred way, frozen samples can be directly 
extracted by immediately adding organic solvents and applying heat, thereby also 
inhibiting the recovery of enzymatic activity. Extracting frozen samples that still contain 
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the original amount of water can be advantageous for applications including analysis of 
metabolites when compared to extracting lyophilized samples, since lyophilisation may 
potentially lead to the irreversible adsorption of metabolites on cell walls or membranes. 
In cases where it is desired to distinguish between metabolite levels in different 
compartments, samples can be lyophilized prior to non-aqueous fractionation methods 
(Gerhardt and Heldt 1984, Farre et al. 2001). An alternative approach to non-aqueous 
fractionation is the use of nuclear magnetic resonance analyses (NMR) to distinguish 
steady state concentrations of metabolites in different compartments in vivo (Roberts 
2000). For tissue culture, a cold shock can be used in which the liquids are infused into 
cold methanol. All devices needed for further sample preparation should then be kept at 
cold temperatures (Gonzalez et al. 1997). In addition, polar organic solvents like 
methanol, methanol-water mixtures, or ethanol can be directly added to freshly frozen 
tissues (Johansen et al. 1996, Streeter and Strimbu 1998), with an additional step of 
using non-polar solvents such as chloroform to exhaustively extract lipophilic 
components. In order to enhance the extraction efficiency, additional energy may be put 
into the system either directly by heat (e.g. 70°C), or by other techniques such as 
pressurized liquid extraction (Benthin et al. 1999), supercritical fluid extraction (Jarvis and 
Morgan 1997, Blanch et al. 1999, Castioni et al. 1995), sonication (Sargenti and 
Vichnewski 2000), subcritical water extraction (Gamiz-Gracia and de Castro 2000), 
microwave techniques (Namiesnik and Gorecki 2000), or pervaporation (Starmans and 
Nijhuis1996). 

In a preferred embodiment of the method of the invention, extracting comprises the steps 
of: 

(a) extracting the metabolites from said sample with at least one solvent or mixture of 
solvents; 

(b) extracting the proteins from the remainder of the sample after step (a); 

(c) extracting the RNA from the remainder of the sample after step (a); and 

(d) optionally dissolving remaining cellular material contained in said sample. 

This sequence of steps describes the isolation of all three compound classes. If, 
however, one is interested in data of only two compound classes, these steps may be 
adapted correspondingly, for instance by omitting step (a) or (b) or (c), depending on 
whether metabolites, protein or RNA shall not be analyzed. The sequence of steps (a) to 
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(d) is not necessarily chronological. In particular, step (c) may for instance also be carried 
out before step (b) or simultaneously. 

In step (a), the sample is extracted with an extractant that comprises at least a solvent or 
a mixture of solvents. The extractant may contain additional substances such as 
antioxidants, detergents or buffering agents. The term "solvent" refers to the common 
meaning as used in the prior art, while "aqueous solvent" refers to water or an aqueous 
buffer. 

In a particularly preferred embodiment, step (a) is earned out with a mixture of solvents 
that comprises at least one highly polar solvent, at least one less polar solvent and at 
least one lipophilic solvent. 

The terms "highly polaf , "less polar and "lipophilic" in connection with solvent has a clear 
meaning to the person skilled in the art. Usually, the polarity of a solvent is defined by 
reference to the dielectric constant (e) of the solvent. This quantity is temperature- 
dependent, often it is given for solvents at 20°C. In connection with the method of the 
invention, "highly polar" solvents have a dielectric constant in the range from above 35 to 
90, preferably from 36 to 81. "Less polar" solvents have a dielectric constant in the range 
of above 5.5 to 35, preferably from 6 to 33. "Lipophilic" solvents have a dielectric constant 
in the range from 1.0 to 5.5, preferably 1.8 to 5.0. Typical examples of highly polar 
solvents are acetonitrile, N.N-dimethylacetamide, N,N-dimethylformamide, dimethyl 
sulfoxide, glycerol, nitromethane and water. Typical expamples of less polar solvents are 
acetic acid, acetone, 2-bu1anol, 1 ,2-dichloroethane, dichloromethane, ethanol, ethyl 
acetate, methanol, 1-propanol, 2-propanol, pyridine and tetrahydrofuran. Furthermore, 
typical examples of lipophilic solvents are benzene, carbon tetrachloride, chloroform, 
cyclohexane, diethyl ether, 1,4-dioxane, heptane, hexane, pentane, tetrachloroethylene 
and toluene. 

Preferably, the mixture of highly polar, less polar and lipophilic solvents has one phase. 
Accordingly, the solvents should be chosen so that, at the particular conditions of 
extraction applied, the mixture of solvents used has one phase. More preferably, the 
mixture of solvents for use in step (a) comprises water, methanol and chloroform, most 
preferably these solvents are contained in the mixture in the approximate proportion by 
volume of 1:2.5:1 
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In the method of the invention, the temperature under which extraction is carried out can 
be crucial to successfully extract a broad range of compounds, in particular metabolites. 
In the experiments underlying the present invention, a cold temperature has been shown 
to be favorable in this regard. Accordingly, it is preferred that extraction, in particular the 
above-mentioned step (a) of the preferred extraction scheme, is carried out at a 
temperature in the range between -60°C and +4°C, more preferably between -40°C and 
-10°C, still more preferably between -20°C and -13°C and most preferably at -16°C. 
Practically, extraction should be carried out at a temperature where the solvents used are 
liquid. 

Steps (a) to (d) of the above-described preferred embodiment of the method of the 
invention can be carried out according to techniques known to the person skilled in the art 
and described in the literature. In order to illustrate the sequence of the extraction steps, 
Figure 1 shows a specific scheme containing steps (a) to (c). 

Generally, in step (a), the metabolites are extracted from the sample. For this purpose, 
the cells or tissue may be broken up using appropriate techniques such as 
homogenization. After incubating the sample material for sufficient time to dissolve the 
metabolites quantitatively in the extractant, the solution containing the metabolites should 
be separated from undissolved material (remainder of the sample). 
In steps (b) and (c), the proteins and RNA may be extracted from the remainder of the 
sample after step (a), using a suitable extractant, e.g. a buffered aqueous solution. The 
resulting extract can then be subjected to a further separation step, wherein proteins are 
separated from RNA. This can, for instance, be done by phenol extraction, Le. by adding 
phenol, mixing and centrifuging with the result that RNA is in the aqueous phase and 
proteins in the phenol phase. An overview of various RNA isolation techniques is 
described in Sambrook and Russell (2001) and Gassen and Schrimpf (1999). 
Optionally, as step (d), dissolving remaining cellular material contained in the sample may 
also be performed in order to provide additional compounds for quantitative analysis. 
Such undissolved material may comprise membranes or cell wall material that can be 
dissolved using suitable means such as hydrolyzing enzymes. The hydrolysates may for 
instance be added to the metabolite fraction or may be analyzed individually. 
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In a preferred embodiment, the method of the invention further comprises removing 
detection-disturbing compounds from the metabolites, the polypeptides and/or the RNA 
prior to identifying and quantifying the metabolites, proteins and/or RNA. This 
embodiment refers in particular to carbohydrates or other compounds that disturb 
identification and quantification of RNA. Routinely, compounds that may disturb the 
detection of the individual compound classes are removed by suitable techniques known 
to the skilled practitioner if such a removal improves the quality and significance of the 
detection (i.e. identification and quantification). For example, it has been shown that the 
presence of carbohydrates disturbs the detection of RNA by microarrays and that the 
removal of the carbohydrates from the sample may significantly improve the quality of the 
detected signals. 

Identification and quantification of the compounds of interest extracted from the sample 
can be done according to well-known techniques known in the prior art. For each of the 
compound classes in question, techniques are described that allow identification and 
quantification in one step and, moreover, are suited to record the respective compounds 
contained in the extracts in a comprehensive manner. 

For example, metabolites may be identified and quantified using gas 
chromatography/mass spectrometry (GC/MS), liquid chromatography/mass spectrometry 
(LC/MS), NMR or FT-IR or combinations thereof. Further useful methods include LC/UV, 
refractory index determination, the use of radioactivity in connection with suitable 
methods known to the skilled person, thin layer chromatography (TLC), capillary 
electrophoresis (CE), CE/UV, CE/laser induced fluorescence (LIF), fluorescence 
detection, electrochemical detection (i.e. colorimetry), direct injection MS, flow injection 
MS, MS/MS, MS/MS/MS, and further combinations of MS steps (MS n ), fourier transform 
ion mass spectrometry (FT/MS), and gel permeation chromatography (GPC). 
An exemplary non-biased analysis is described in Fiehn et al. (2000). With the aim of 
functionally characterizing plant mutants, detection and relative quantifications of 326 
distinct compounds (ranging from primary polar metabolites to sterols) was carried out for 
both identified and non-identified compounds, after normalization to internal references 
and plant tissue fresh weights. Different plant mutants were compared to the 
corresponding parental genotypic backgrounds, and the data were used for statistical 
analysis as well as for defining metabolic phenotypes that were derived from clustering 
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tools. Another example of GC/MS analyses that can be applied in the method of the 
invention has been described by Roessner et af. (2001), who used ft for comprehensively 
studying the metabolism in potato tubers. Alternatively, metabolite data can be obtained 
by extended chromatographic analysis as described by Tweeddale et al. (1998) where, 
after growing wild type and mutant E.coli strains in minimal media and 14 C-labelled 
glucose, 70 metabolites could be separated using two dimensional thin layer 
chromatography. The relative quantification of metabolites was carried out by radioactive 
detection. 

Suitable techniques for identifying and quantifying proteins are known to the person 
skilled in the art and described in the literature. They include for example LC/MS and two- 
dimensional electrophoresis and protein staining. A comprehensive overview of different 
strategies for protein extraction and proteome analysis from various tissue that may be 
useful in connection with the method of the invention can be found in "2D-Proteome 
Analysis Protocol" by Andrew Link (1999). For example, the use of 2D-gels combined 
with mass spectrometry, usually MALDI-TOF, allows the detection and identification of a 
large number of proteins from a sample in a quality that facilitates the comparison of 
different protein profiles (Nock et al. 1998; Eggeling et al. 1998). For applications that 
require a unbiasedness, resolution and/or reproducibility that exceeds the potential of 
two-dimensional gel electrophoresis (2DE) (e.g. as discussed in Gygi et al. 2000), there 
are alternative techniques available in the prior art. For instance, Wall et al. have shown 
that protein mixtures can be fractionated using isoelectric focusing in the liquid phase 
(Wall et al. 2000, 2001). By coupling to monolithic columns and digesting the protein prior 
to mass spectrometry, it is possible to achieve the identification of hundreds of proteins. 
For the purpose of high throughput approaches relative protein quantification as 
described by Gygi et al. (1999b) may be the technique of choice. When analyzing 
proteins from yeast, Gygi and co-workers were able to achieve a repeatability better than 
12% relative standard deviation. This method involves linking protein Cys-residues to 
stable isotopically labeled chemicals that include biotin moieties for sample purification 
and preconcentration. This method, also referred to as isotope-coded affinity tag (ICAT), 
has been successfully applied in the experiments made in connection with the present 
invention (see Example and Figures 4B and C). Protein identification and quantification 
may likewise be done by de novo peptide sequencing (Goodlett et al. 2001). In addition, 
when applying LC/MS-based methods it may be beneficial to increase the run times of 
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liquid chromatography in order to ensure that only a few peptides per time interval reach 
the mass spectrometer (Washburn et al. 2001). This may be achieved by stepwise 
elution of the peptide mixture from strong cation exchangers onto nanoscale reverse 
phase columns prior to ion trap MS/MS experiments. By applying this strategy, it was 
possible to reach a genome coverage of -25 percent for yeast. In total, 1500 proteins 
were detected, ranging from low abundant transcription factors to proteins with up to 14 
transmembrane domains, and high and low abundant enzymes. Most importantly, it could 
be proven that no bias against codon usage was found in the detected proteins. With 
regard to MS and subsequent MS/MS experiments, it may be recommendable to use 
mass spectrometers with ultimate sensitivity and mass resolution. Here, for example, a 
combination of nanoLC columns (Li et al. 2001, Shen et al. 2001a, 2001b) or capillary 
isoelectric focusing (Jensen et al. 2000) prior to fourier-transform ion cyclotron mass 
spectrometry (FT-MS) can be of use. By applying either of these techniques, up to 10 6 
peptides per run can theoretically be separated, with up to 50,000 peptides found 
experimentally. Another strategy feasible for the method of the invention may be labeling 
approaches using stable isotopes. Such a method was originally developed by Aebersold 
and then adapted to proteomic experiments (Smith et al. 2001, Goshe et al. 2001). Using 
such a method, it was possible to identify and quantify up to a thousand proteins in 4 h 
runs. Moreover, by using infrared laser photodissociation, peptides can be fragmented 
within the cyclotron in order to confirm peptide identification that is for example solely 
based on accurate masses (Conrads et al. 2000). 

RNA can be identified and quantified using a number of techniques currently available in 
the field such as hybridization on nylonfilters, cDNA microarrays, DNA chips loaded with 
oligonucleotides (see Granjeaud, Bioessays 21 (1999), 781-790), serial analysis of gene 
expression (SAGE; Velculescu, Science 270 (1995), 484-487), massively parallel 
signature sequencing (MPSS; Brenner, Nat. Biotechnol. 18(6) (2000), 630-634) 
differential-display-based approaches, such as arbitrarily primed (AP) PCR (Welsh, Nucl. 
Acids Res. 20 (1992), 4965-4970) and cDNA-amplified fragment length polymorphism 
(AFLP) (Bachem, Plant J. 9 (1996), 745-753). Systematic approaches for a 
comprehensive mRNA expression profiling that may be applied in connection with the 
method of the invention are described in Hughes et al. (2000) and Lockhart and Winzeler 
(2000). 
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The term "quantitatively analyzing metabolites, proteins and/or RNA" refers to any 
mathematical analysis method that is suited to further process the quantitative data 
provided by the method of the invention. This data represents the amount of the 
compounds analyzed present in each sample either in absolute terms (e.g. weight or 
moles per weight sample) or in relative terms (i.e. normalized to a certain reference 
quantity). 

Quantitative analysis involves suitable statistical evaluation and correlation analyses. The 
former includes normalization to the total content of the respective compounds within one 
compound class, correction of background levels and the combination of the data sets 
obtained from different compound classes into a single data sheet. Corresponding 
mathematical methods and computer programs are known to the practicioner. Examples 
include SAS, SPSS and systatR. As the next step, the statistically pre-treated data may 
be subjected to a pairwise correlation analysis. Here series of pairs of data points from 
the analyzed compounds are looked at for correlation, whether positive or negative, for 
instance using Pearson's correlation coefficient. In a preferred embodiment, the 
quantitative analysis referred to in the method of the invention furthermore involves 
network analysis. Network analysis aims at finding out higher order interplays of multiple 
factors on the basis of pairwise correlation data. By taking several data sets each 
obtained from one sample, correlations between metabolites, proteins, and/or RNA as 
well as among these classes of compounds can be analyzed in order to derive 
information about the network regulation of biological systems, e.g. upon genetic or 
environmental perturbation. A comprehensive overview of methods for quantitatively 
analyzing data obtained according to the method of the invention including principle 
component analysis, "snapshot analysis", pearson correlation analysis, mutual 
information and network analyses can be found in Fiehn (2001). 

The present invention furthermore relates to a method for quantitatively analyzing 
metabolites, proteins and/or RNA in a biological source material comprising 

(a) providing data on metabolites, proteins and/or RNA in said biological source 
material according to the method described above; 

(b) performing suitable statistical evaluation and correlation analysis on the data 
obtained; and 

(c) optionally further performing a network analysis on the data obtained in step (b). 
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The features of this method have already been described above in connection with the 
method for providing data and likewise apply in the context of this aspect of the invention. 

In addition, the present invention relates to the use of a mixture of solvents comprising at 
least a highly polar solvent, at least a less polar solvent and at least a lipophilic solvent as 
described herein above for extracting metabolites from a sample of a biological source 
material in order to perform metabolite profiling. 

It has been shown that such a mixture of solvents as it is characterized in its various 
aspects above is better suited for extracting metabolites than other extraction methods 
described in the prior art such as subsequent application of pure solvents or 
methanol/water mixtures, or other methods such as microwave extraction, 
ultrasonication, or accelerated solvent extraction. Thus, it is possible to benefit from the 
special extraction strength of such a mixture of solvents for metabolite profiling. The term 
"metabolite profiling" refers to quantitatively analyzing metabolites according to the 
explanations given above. 

According to a preferred embodiment of this use, additionally proteins and/or RNA is 
extracted from said sample. 

These and other embodiments are disclosed and encompassed by the description 
and examples of the present invention. Further literature concerning any one of the 
methods, uses and compounds to be employed in accordance with the present 
invention may be retrieved from public libraries, using for example electronic devices. 
For example the public database "Medline" may be utilized which is available on the 
Internet, for example under http://www.ncbi.nlm.nih.gov/PubMed/medline.html. 
Further databases and addresses, such as http://www.ncbi.nlm.nih.gov/, 
http://www.infobiogen.fr/, http://www.fmi.ch/biology/research_tools.html, 
http://www.tigr.org/, are known to the person skilled in the art and can also be 
obtained using, e.g., http://www.google.de. An overview of patent information in 
biotechnology and a survey of relevant sources of patent information useful for 
retrospective searching and for current awareness is given in Berks, TIBTECH 12 
(1994), 352-364. 
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Furthermore, the term "and/or" when occurring herein includes the meaning of "and", 
"or" and "all or any other combination of the elements connected by said term". 

The present invention is further described by reference to the following non-limiting 
figures and examples. 

The Figures show: 

Figure 1: Scheme illustrating the steps performed in the experiment of the 
Example. 

Figure 2: A: GC/MS (TOF) direct analysis of hydrophilic and lipophilic metabolites. 
B: LC/MS/MS analysis of the metabolites. 

Figure 3: A: SDS PAGE, lanes 1 to 3 show the proteins from three different samples 
extracted according to the scheme shown in Figure 1 . 
B: Agarose gel electrophoresis, lines 1 to 9 show RNA from nine different 
samples extracted according to the scheme shown in Figure 1 . 

Figure 4: A: Above: LC/MS/MS analysis of peptides obtained upon trypsination of 
protein extracted from Arabidopsis leaf according to the method described. 
A list of the proteins identified in the LC/MS/MS analysis is given in Table 3. 
B and C: ICAT analysis of proteins extracted according to the scheme 
shown in Figure 1 and explained in the Example from an Arabidopsis 
sample and an Arabidopsis control sample showing the relative 
quantification of the proteins (for details see Gygi et al. 1999b). B: list of 
peptides in one fraction after cation-exchange- and avidin-affinity 
chromatography. C: MALDI TOF spectrum with enlarged detail of a peptide 
pair for relative quantification 



Figure 5: 



Metabolite network of Pearson correlations using 30 samples. 
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Figure 6: Analysis of metabolites, proteins, and transcripts extracted from a single 
Arabidopsis leaf sample. 

A Functional characterisation of identified proteins from a single 
Arabidopsis Col2 leaf sample. Majority of the proteins are 
chloroplast-related. 

B Comparison of the integrative extraction protocol with a conventional 
plant RNA extraction kit 

C RNA blot analysis of Arabidopsis isopropylmalate synthase (IPMS) 
transcripts (two isoforms, GI9758 and GI9759360) in three replicate 
samples of Arabidopsis Col2 leafs extracted according to the 
scheme in Figure 1 . 

Figure 7: Principle component and hierarchical cluster analysis of metabolites and 
proteins. 

A Integrated principle component analysis (PCA) of metabolites and 
proteins reveals complete clustering of the two genotypes 

B Individual loadings of the metabolites and proteins for two different 
Arabidopsis varieties C24 and Col2. 

C Hierarchical cluster analysis (HCA) of a merged metabolite-protein 
dataset. (for details see text). Abbreviations of proteins and 
metabolites: RUBISCO activase: ribulose-1,5-bisphosphate 
carboxylase/oxygenase activase (S04048); RUBISCO: ribulose-1,5- 
bispbosphate carboxylase/oxygenase (NP_051 067); GAPDH: 
glyceraldehyde-3-phosphate dehydrogenase (AAD10209); asc 
peroxidase: L-ascorbate peroxidase (S20866); put kinase: protein 
kinase, putative (At3g24550); SUC: sucrose; FUC: fucose; SHI: 
shikimate; CP12 like: CP12 protein precursor-like protein 
(At3g62410); ATPsynthase: ATP synthase CF1 beta chain 
(NP_051066); peroxidase: peroxidase, putative (At3g49120); put 
protein: protein, putative (At3g63190); put TK: transketolase - like 
protein (At3g60750), put aldolase: putative fructose-bisphosphate 
aldolase (AF428455J); put oxidase: glycolate oxidase 
(At3g14420); GST protein: spindly (gibberellin signal transduction 
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protein) (At3g1 1540); ATPase: ATPase alpha subunit (NP_051044); 
EF-1: translation elongation factor eEF-1 alpha chain (gene A4) 
(S08534); catalase: catalase (MB07026); put protein: protein, 
putative (At3g47140); P-protein like: (At4g33010); put protein: 
protein, putative (At3g57190); PS I RC: putative photosystem I 
reaction center subunit II precursor (At1g03130); PS I SUI: 
photosystem I subunit III precursor (CAB52747); CIT: citrate; XYL: 
xylose; TRE: trehalose; SIN: sinapinate; GAL: galacturonate; 
CITMA: citramalate; ASP: aspartate; GLUC: gluconate; MAL: 
malate; INO: inositol; FUM: fumarate. 

The following Examples illustrate the invention: 

Experimental set-up 

Plant material 

Arabidopsis thaliana plants were cultivated in phytotrons under highly controlled light, 
gas, and temperature conditions assuring approximately identical environmental 
conditions for each plant sample. Biological variation among independent samples of the 
same genotypes is attributed to the inherent fluctuation of the biochemical network due to 
slightly changed environments. 

Extraction procedure 

30-100 mg Arabidopsis leaf sample at a developmental stage of 1.1 according to Boyes 
et al. (2001) were harvested and immediately frozen in liquid nitrogen. Tissue was 
homogenized under liquid nitrogen using a retsch mill. A one-phase solvent mixture of 
methanol/chloroform/water 2.5:1:1 (v/v/v) was kept at -20 °C and 2 mL was added to the 
tissue and thoroughly mixed at 4 °C for 30 min to precipitate proteins and DNA/RNA and 
to disassociate metabolites from membrane and cell wall components. After 
centrifugation, the remaining pellet consisting of DNA/RNA, proteins, starch, membranes, 
and cell wall components was extracted in a second step with 1 mL methanol/chloroform 
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1 :1 (v/v) at -20°C. The organic solvent extracts were combined and further used for 
metabolite analysis via GC-TOF. For that purpose the chloroform phase was separated 
from the water/methanol phase by adding 500 [i\ water. The resulting water/methanol 
phase now contained all hydrophilic metabolites such as sugars, amino acids and organic 
acids, and the chloroform phase all the lipophilic compounds, lipids, chlorophyll, and 
waxes. The remaining white pellet was further partitioned according to the scheme in 
Figure 1. The pellet was extracted with 1 ml_ extraction buffer (0,05 M Tris, pH7,6; 0,5 % 
SDS; 1 % 6-Mercaptoethanol) and 1 mL water saturated phenol for 1h at 37°C. After 
centrifugation at 14,000g the remaining pellet was used for cell wall synthesis (data not 
shown). The phenol phase was separated from the buffer phase and the proteins were 
precipitated with ice-cold acetone in -20 °C overnight, washed three times with ethanol 
and dried at room temperature. Remaining protein in the RNA-buffer phase was 
precipitated with 200 uL chloroform. After centrifugation and separation of the buffer 
phase, 40ul_ of acetic acid and 1 mL ethanol were added to precipitate the RNA at 4 °C 
for 30 min. The pellet was washed with one volume 3 M sodium acetate, and two times 
with one volume 70% ethanol. The remaining pellet was dissolved in 100 uL RNAse-free 
water. Amounts and purity of RNA were checked by absorbance at 260 nm and gel 
electrophoresis in agarose. Construction of Arabidopsis isopropyl-malate synthase 
(IPMS) probes for hybridisation and Northern blots were performed using standard 
protocols. 

GC-TOF analysis 

For GC-TOF analysis, the organic phase was dried and dissolved in 50ul of 
methoxamine hydrochloride (20mg/ml_ pyridine) and incubated at 30 °C for 90 min with 
continuous shaking. Then 80 uL of N-Methyl-N-trimethylsilyltrifluoroacetamid (MSTFA) 
was added to derivatize polar functional groups at 37°C for 30 min. The derivatized 
samples were stored at room temperature for 120 min before injection. GC-TOF analysis 
was performed on a HP 5890 gas chromatograph with standard liners and splitless 
injection at 230°C injector temperature. The GC was operated at constant flow of 1 
mUmin Helium and a 40 m 0.25 mm ID 0.25 urn RTX-5 column with 10 m integrated pre- 
column. The temperature gradient started at 80°C, was held isocratic for 2 min, and 
subsequently ramped at 15°C/min to a final temperature of 330°C which was held for 6 
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min. 20 spectra per second were recorded between m/z 85 to 500. After data acquisition 
was finished, reference chromatograms were defined that had a maximum of detected 
peaks over a signal/noise threshold of 20 and used for automated peak identification 
based on mass spectral comparison to a standard NIST 98. Automated assignments of 
unique ions for each individual metabolite were taken as default as quantifiers, and 
manually corrected where necessary. All artifactual peaks caused by column bleeding or 
phtalates and polysiloxanes derived from MSTFA hydrolysation were removed from the 
results table. All data were normalized to plant mg FW and to the internal references and 
log-transformed, t-test, correlation analysis, and variance analysis were performed in 
Excel 5. 

Two-dimensional liquid chromatography/mass spectrometry. 

The dried protein pellet was dissolved in freshly prepared 1 M Urea in 0.05 M Tris buffer 
pH 7.6. The complex protein mixture was digested with modified trypsin (Bohringer 
Mannheim) according to the manufacturer's instructions. The tryptic digest was dried 
down and dissolved in 300 pi water (1% formic acid). Unsoluble material was removed by 
centrifugation. An aliquot of the digest (-100 ug protein) was injected onto two- 
dimensional chromatography on a thermofinnigan proteomeX system coupled to an 
LCQDecaXp ion trap (Thermofinnigan). The chromatographic separation was done 
according to manufacturer's instructions. After a 12 cycle run the MS/MS spectra were 
searched against an Arabidopsis thaliana database (downloaded from the TAIR 
homepage www.arabidopsis.ora ) using Turbosequest implemented in Bioworks 3.0 
(Thermofinnigan). Matches were filtered according to Wolters et al. (2001). Additionally, 
we used the multiple scoring fitter of Bioworks 3.0 with 50 percent ion coverage. For the 
quantification approach aliquots of the complex tryptic digest of Arabidopsis leaf protein ( 
50 uL) were analysed on reversed phase chromatography. Quantification was achieved 
by integrating peak areas of target peptides representative for proteins. These peak 
areas were normalised to the sum of internal standard peptides that had been added to 
the mixture (Chelius et al. (2002), Bondarenko et al. (2002)). 
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Statistical analysis 

All quantitative metabolite and protein data were normalized to FW. Principle component 
analysis (PCA) and hierarchical cluster analysis (HCA) pattern recognition was performed 
according to Fiehn et al. (2000) using Pirouette software (Infometrix, Woodinville, WA). 
The integrative data set of metabolites and proteins was Iog10 transformed. The HCA 
was performed using Euclidian distances and complete linkage grouping. Variance 
analyses were performed in MS Excel 5.0. 

Example 1 : Network analysis of compounds In Arabidopsis leaf 

A sample of an Arabidopsis thaliana leaf (30mg FW) was extracted with a cold (-16°C) 
mixture of chloroform / methanol / water (1 : 2.5 : 1 v/v/v). This mixture is a one-phase 
solution and proved to have an improved extraction strength for metabolites in a 
comprehensive scale. Identification and relative quantitation of these metabolites was 
done with GC/MS and LC/MS analysis (see Figure 2A and 2B). In a subsequent step, 
proteins and RNA were isolated from the remaining cell residue by buffer/phenol 
extraction and phase separation (see Figure 3A and 3B). The protein was precipitated 
from the phenol phase with methanol/acetate, washed three times and dried. After tryptic 
digestion the proteins were identified and relatively quantitated by LC/MS and LC/MS/MS 
(see Figure 4). RNA was precipitated from the aqueous phase with ethanol and 
quantitated by gel electrophoresis (see Figure 3B). 

Last, all data were normalized to the total content of the respective gene products, 
corrected for background levels, and combined in a single data sheet. For each individual 
data set, or for the combined data set, network calculation can be done. 
As an example, all pair-wise correlations for a metabolite dataset of 30 samples were 
analyzed for Pearson's correlation coefficients and visualized in a network (see Figure 5). 

Example 2: Correlation profiling of proteins and metabolites of two Arabidopsis 
genotypes 



For each replicate, 30-100 mg FW leaf tissue of an individual Arabidopsis thaliana plant 
was extracted at 4°C with chloroform/methanol/water (1:2.5:1 v/v/v) according to the 
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scheme illustrated in Figure 1. This single-phase mixture proved to have improved 
extraction strength for metabolites in comparison to the former extraction protocol utilizing 
a methanol/water mixture for 15 min at 70°C. However, when chloroform was left out of 
the cold extraction mix, i.e. if methanol/water extraction mixtures were used at -20°C, a 
strong decrease in sucrose content was detected in subsequent GC/TOF analyses, 
concomitant with a sharp increase in fructose and glucose contents. This indicates that 
chloroform may inhibit sucrose cleaving enzyme activity, e.g. by invertase or sucrose 
synthase, by precipitating these enzymes. Total metabolite analysis was performed with 
GC/TOF (Weckwerth et al., 2001) (see Figure 2A) enabling the detection and 
quantification of 652 metabolites (see Table 1). Replication of metabolite analysis 
revealed a high recovery and a mean coefficient of variance (CV) of 10%. In a 
subsequent step, proteins and mRNA were isolated from the remaining cell residue using 
buffer/phenol extraction and phase separation (see Figure 1). In Figure 6B, a comparison 
of this method with a conventional RNA extraction method is shown. The extraction 
procedure achieves a higher level of RNA recovery than does a typical RNA isolation kit 
extraction (see Experimental set-up, supra) with 30% CV in 28 samples. To test the utility 
of the mRNA for hybridisation we analysed the expression of isopropylmalate synthase 
(IPMS) from Arabidopsis (see Figure 6C). The average amount of total protein extracted 
according to the scheme in Figure 1 was 1.3 mg per 100mg FW with 17% CV. The 
overall extraction process resulted in good recovery of metabolites, proteins, and 
transcripts. After complete extraction, the remaining cell pellet was used for cell wall 
analysis giving rise to clear and typical cell wall profiling (data not shown). 
The protein fraction was analysed using shotgun proteomics (Wolters et al., 2001; 
Washburn et al., 2001 ; Koller et al., 2002). The complex mixture of the tryptic Arabidopsis 
leaf protein digest was analysed via two-dimensional capillary liquid chromatography and 
tandem mass spectrometry on an ion trap mass spectrometer (LCQ Deca Xp Plus) and a 
subsequent database search performed using TurboSequest implemented in 
ThermoFinnigan Bioworks 3.0. In a single Arabidopsis Col2 leaf sample extracted 
according to the scheme in Figure 1 586 peptides and 297 corresponding proteins have 
been identified using very stringent criteria to avoid false positives (see Experimental set- 
up (supra) and Table 2). A classification of the detected proteins from one sample is 
shown in Figure 6A. We applied the integrative extraction process to two Arabidopsis 
genotypes, C24 and Col2, to test if we are able to determine different biochemical 
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phenotypes and general biochemical patterns using this strategy. C24 and Col2 showed 
an overlap of 153 proteins (see Table 2). The data-dependent detection of peptides was 
strongly contingent on the estimated abundance of the corresponding proteins in the 
digest such as RUBISCO (Pang et al., 2002). Thus, the high number of non-overlapping 
proteins also indicates differences in the protein-profiles of these different Arabidopsis 
genotypes. A set of 22 proteins appearing in both varieties was chosen for the 
quantification approach. These proteins were quantified by integrating their 
corresponding peptide areas in a one-dimensional LC/MS analysis. These areas were 
normalized to internal standard peptides as described in Chelius et al. (2002) and 
Bondarenko et al. (2002). The analytical precision was tested by adding internal standard 
peptides to the sample. The deviation of the internal standards - in other words the 
technical variation of the extraction process, stability of electrospray and matrix effects - 
was -25 % CV. Each genotype was represented by ten independent samples. The 
relative integrals of the peptides in each sample were normalised to the fresh weight 
(FW) of the corresponding sample. 

The metabolites in the corresponding samples were identified and quantified with 
GC/TOF. A list of all identified metabolites is given in Table 1. Fourteen of the most 
abundant metabolites were normalized to the FW and combined with the data of the 
quantified proteins to form an integrative dataset. Most important for the integrative 
analysis of such heterogeneous data is the appropriate transformation. A homogeneous 
dataset was achieved by applying Iog10 transformation (see statistical analysis, supra). 
The first step of analysis is to test if we are able to discriminate the two genotypes 
suggesting different biochemical phenotypes under the same environment. We applied 
principle component analysis (PCA) according to Fiehn et al. (2000). Both Col2 and C24 
were completely separated into genotype-clusters (Figure 7A and 7B). In contrast to 
NMR or other fingerprinting methods, the individual identification of compounds by our 
method enables the investigation of distinct metabolite-protein cross-correlations in a 
multitude of samples. The aim is to reveal hierarchical structures within complex 
biochemical networks depending on the genotype-phenotype relationship (Cooper et al., 
2002). Based on the detection of these fundamental correlations in an integrative dataset 
- for instance a distance or a Pearsons Matrix - it is possible to expose instantaneous 
causal connectivities in a regulatory network representing a snapshot of the actual state 
of the system (Arkin et al., 1997; Kell et al., 2000; Vance et al., 2002). To make use of 
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such a refined analysis it is important to differentiate biological variability and technical 
measurement error. The quantified proteins showed an overall variability of -39% 
whereas individual variation was up to 70% exceeding clearly the overall analytical 
precision of -25% CV. The same was observed for the metabolites according to Fiehn et 
al. (2000). We calculated the ratio of standard deviation to the mean for every variable, 
metabolite and protein. These ratios appeared not to be correlated to the means 
(rmetaboiites= 0.38 and rproteins^.23), indicating that the relative variation of these 
compounds does not depend on their abundance. This is indicative of high biological 
variation among independent samples, even samples collected from tissues at seemingly 
identical developmental stage and grown under highly controlled environmental 
conditions. In Figure 7C, a hierarchical analysis of the set of quantified proteins and 
metabolites is shown. C24 and Col2 datasets are merged together to detect biochemical 
patterns conserved for both genotypes. A strongly conserved pattern for both varieties is 
detected for Calvin cycle enzymes such as ribulose-1 ,5-bisphosphate 
carboxylase/oxygenase (RUBISCO) and 3-glyceraldehyde dehydrogenase (GAPDH) 
which is in agreement with the literature (Krapp et al., 1993). Metabolites included in this 
cluster are sucrose (SUC) and fucose (FUC) suggesting the coordination of sucrose 
synthesis and degradation and photosynthetic activity. Surprisingly, ascorbate peroxidase 
is integrated into the Calvin cycle/sucrose cluster giving hints to the connectivity of the 
oxidative state and carbohydrate metabolism in plants. Inside the metabolite cluster, 
biochemically related structures such as malate (MAL) and fumarate (FUM) and 
carbohydrates form subclusters as expected. 
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Table 1 summarizing the metabolites of known structure that were identified 
in the leaf tissue extracts as described in Example 2 



Entrv 


Name 


Sum formula 


Mol. weiqht 


Substance class 




C7TMS 


C10H22O2Si 


202 


fatty acid time standard 


2 


C9TMS 


C12H2602Si 


230 


fatty acid Sme standard 


4 


C11 TMS 


C14H30O2Si 


258 


fatty 3cid time standard 




C13TMS 


C16H3402Si 


286 


fatty acid time standard 


6 


C15TMS 


C18H3802S 




314 


fatty acid time standard 


7 


C19TMS 


C22H4602S 






fatty acid time standard 


8 


C23TMS 


C26H5402S 






fatty acid time standard 


g 


C27TMS 


C30H62O2Si 




fatty acid time standard 


10 


C31 TMS 


C34H70O2Si 


538 


fatty acid time standard 




RIBITOLTMS 






sugar alcohol quantify 
standard 


239 


1HO-14:0TMS 


C17H380Si 


286 


alcohol 


240 


1HO-16:0TMS 


C19H420Si 


314 


alcohol 


241 


1HO-18:0TMS 


C21H460Si 


342 


alcohol 


242 


1HO-20:0TMS 


C23H50OSi 


370 




243 


1HO-22:0TMS 


C25H540Si 






267 


1HO-2&0TMS 


C29H620Si 






274 


1HO26:0TMS 


C29H620Si 






268 


1HO-28CTMS 


C31H660Si 


482 


alcohol 


269 


1HO-29:0TMS 


C32H6SOSi 


496 




270 


1HO30.-0TMS 


C33H7QOSi 


510 




254 


1HO-31:0TMS 


C34H720Si 






271 


1HO-32:0TMS 


C35H740Si 


538 


alcohol ~ 


256 


1HO-33STMS 


C36H760Si 


552 


alcohol 


308 


ETHYLENEGLYCOL,0,0-TMS 


C8H2202Si2 


206 




167 


GLYCEROL 3TMS 


C12H3203Si3 


308 




324 


PROPAN-1 ,2-DIOL,0,0-TMS 


C9H2402Si2 


220 




293 


OXAMIDE.N, N-TMS 


C8H20N2O2Si2 






296 


OXAMIDE,N,N-TMS MEOX 


C9H23N302Si2 


261 


amid 


130 


1 ,3-DIAMINOPROPANE,N,N,N,N-TMS 


C15H42N2Si4 


362 




303 


CADAVERINE,N,N,N,N-TMS 


C17H46N2Si4 






306 


DIETHAN0LAMINE,N,O,O-TMS 


C13H35N02Si3 


321 




307 


DIETHANOLAMINE,0,0-TMS 


C10H27NO2Si2 


249 




331 


ETHANOLAMJNE^l,N,0-TMS 


C11H31NOSi3 






321 


PHENETHYLAMINE, N-TMS 


C14H27NSi2 


265 


amSe 


310 


PUTRESCINE,N,N,N,N-TMS 


C16H44N2Si4 


376 




309 


SPERMIDINE,N,N,N,N,N-TMS 






amine 


186 


2-AMINO-AD1PINIC ACID 3TMS 


C15H35N04Si3 




amino acid 


322 


2-METHYLSERINE,N r O,0-TMS 


C13H33N03Si3 


335 


amino acid 


320 


2-METHYLSERINE,0,0-TMS 


C10H25NO3Si2 






15 


ALANINE,N,0-TMS 


C9H23N02Si2 


233 




36 


ALLOTHREONINE,N,0,0-TMS 


C13H33N03SI3 






22 


ASPARAGINE,N : N,N,0-TMS 


C16H40N2O3Si4 






21 


ASPARAGINE,N,N,0-TMS 


C13H32N203SI3 






126 


ASPART1C ACID,N,0,0-TMS 


C13H31N04Si3 


349 


amino acid 


145 


B-ALANINE TMS 


C12H31N02Si3 


305 


amino acid 


304 


CITRULUNE,0,N,N,N-TMS 


C15H37N303SI3 


391 


amino acid 


127 


CYSTATHIONINE,N,N,0,0-TMS 


C19H46N204SSi4 


510 


amino acid 


132 


CYSTEINE,N.O,S-TMS 


C12H31N02SSI3 


337 


amino acid 


29 


CYSTIN TMS 








168 


GLUTAMIC ACID 3TMS 


C14H33N04SI3 


363 


amino acid 




GLUTAMINE,N,N,N,0-TMS 


C17H42N203Si4 


434 


amino acid 


27 


GLUTAMINE.N,"- O TMS 


C14H34N203Si3 


362 




24 


GLYCINE,N,N,0-TMS 


C11H29N02Si3 


291 


amino acid 


265 


GLYCINE,N,0-1f,ir 


C8H21N02Si2 


219 




30 


HOMOCYSTEINE,N,N,0-TMS 


C13H33N02SSi3 


351 




159 


HOMOGLUTAMINE TMS 










HOMOS ERINE 3TMS 


C13H33N03Si3 


335 




13 


ISOLEUCINE,N,0-TMS 


C12H29N02Si2 


275 


amino acid 


153 


L-ARGININMONOHYDROCHLORID TMS 






amino acid 


12 


LEUCINE, N.O-TMS 


C12H29N02SI2 


275 


amino acid 


33 


L-HYDROXYPROLINE,N,0,0-TMS 


C14H33N03SI3 


347 


amino acid 
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32 


LYSINEJM,N,N',0-TMS 


C13H46N202Si4 


434 


amino acid 


14 


METHIONINE, N.O-TMS 


C11H27N02SSi2 


293 


amino add 


312 


NORLEUCINE,N,N,0-TMS 


C15H37N02SB 


347 


amino acid 


316 


N0RLEUCINE,N,O-TMS 


C12H29N02Si2 


275 


amino add 


317 


NORLEUCINE,0-TMS 


C9H21N02SI 


203 


amino acid 


313 


NORVAUNE,N,N,0-TMS 


C14H35N02Si3 


333 


amino acid 


318 


NORVAUNE,N,0-TMS 


C11H27N02Si2 


261 


amino acid 


319 


NORVAUNE.O-TMS 


C6H19N02Si 


189 


amino acid 


16 


ORNITHINE,N,N,N\0-TMS 


C17H44N202Si4 


420 


amino acid 


291 


OXAM 


CACID,N.O-TMSMEOX1 


C9H22N203SE 


262 


amiioacid 


292 


OXAM 


CACID,N,0-TMSMEOX2 


C9H22N203Si2 


262 


amino acid 


332 


OXAM 


CACID.O-TMSMEOX 


C6H14N203Si 


190 


amino acid 


34 


PHENYLALANINE.N.O-TMS 


C15H27N02Si2 


309 


amiioacid 


314 


PIPECOUC ACID,N,0-TMS 


C12H27N02Si2 


273 


amino acid 


315 


PIPECOLICACID.O-TMS 


C9H19N02SI 


201 


amiioacid 


17 


PROLlNE,N,0-TMS 


C11H25N02Si2 


259 


amino acid 


169 


PYROGLUTAMIC ACID 2TMS 


C11H23N03S12 


273 


amino acid 


197 


SACCHAROPINETMS 






amino acid 


19 


SERINE,N,0,0-TMS 


C12H31N03Si3 


321 


amino acid 


20 


SERINE.O.O-TMS 


C9H23N03Si2 


249 


amino acid 


334 


S-METHYLCYSTEINE 


C16H41N02SSt4 


423 


amino acid 


336 


S-METHYLCYSTEINE,N,N,OTMS 


C13H33N02SSi3 


351 




323 


S-METHYLCYSTEINE,N,0-TMS 


C10H25NO2SS12 


279 


amino acid 


31 


THREONINE,N,0,0-TMS 


C13H33N03Si3 


335 


amino acid 


25 


TRYPTOPHANE,N,N',OTMS 


C20H36N2O2Si3 


420 


amino acid 


26 


TRYPTOPHANE,N,0-TMS 


C17H28N202Si2 


348 


amino acid 


18 


VALINE,N,0-™S 


C11H27N02SI2 


261 


amino acid 


311 


PUTRESCINE + C02 4TMS 


C17H44N202Si4 


420 


artef. of putrescine 


286 


DOPAMINE,N,N,0,0-TMS 


C20H43NO2Si4 




catecholamine 


284 


N0RADRENALINE,N,N,O,O,O-TMS 


C23H51N03Si5 


529 


catecholamine 


285 


NORMETHYLADRENAUNE 4TMS 


C21H45N03SI4 




catecholamine 


333 


TYRAMINE.N.O-TMS 


C14H27NOS2 


281 


catecholamine 


74 


CELLOBIOSE MEOX1 TMS 






disaccharide 


75 


CELLOBIOSE MEOX2 TMS 






disaccharide 


69 


ISOMALTOSE MEOX1 TMS 






disaccharide 


70 


ISOMALTOSE MEOX2 TMS 








189 


LACTOSE MEOX1 TMS 






disaccharide 


190 


LACTOSE MEOX2 TMS 






disaccharide 


71 


LACTULOSE TMS 






disaccharide 


134 


LAMINARIBIOSE MEOX1 TMS 






disaccharide 


135 


LAMINARIBIOSE MEOX2TMS 






disaccharide 


65 


MALTOSE MEOX1 TMS 






disaccharide 


66 


MALTOSE MEOX2 TMS 






disaccharide 


72 


MELIBIOSE MEOX1 TMS 






disaccharide 


73 


MELIBIOSE MEOX2 TMS 






disaccharide 


149 


NIGEROSE MEOX1 TMS 






disaccharide 


150 


NIGEROSE MEOX2 TMS 






disaccharide 


76 


PALATINOSE TMS 






disaccharide 


79 


RAFFINOSETMS 






disaccharide 


68 


SUCROSE TMS 






disaccharide 


67 


TREHALOSE TMS 






disaccharide 


77 


TURANOSE MEOX1 TMS 






disaccharide 


78 


TURANOSE MEOX2TMS 






disaccharide 


194 


XYLOBIOSE MEOX1 TMS 






disaccharide 


195 


XYLOBIOSE MEOX2TMS 






disaccharide 


221 


3HO-10:0METMS 


C14H30O3S 




274 


fatty acid 


220 


3 HO-10:1 ME TMS 


C14H2803S 




272 


fatty acid 


223 


3HO-12:0ME TMS 


C16H3403S 




302 


fatty acid 


222 


3 HO-122 ME TMS 


C16H30O3S 




298 


fatty acid 


224 


3 HO-142 ME TMS 


C18H3403SI 


326 


fatty acid 


230 


3HO-14:3 ME TMS 


C18H3403Si 


326 


fatty acid 


225 


3HO-14:4 ME TMS 


C18H3203S! 


324 


fatty acid 


217 


3HO-6:0METMS 


C10H22O3SI 


218 


fatty acid 


218 


3HO-8:0METMS 


C12H2603Si 


246 


fatty ac'd 


219 


3HO-8:1 ME TMS 


C12H2403Si 


244 


fatty acid 


182 


CIS-1 0-HEPTADECENOIC ACID ME 


C18H3402 


282 


fatty acid 


178 


CIS-1 0-PENTAD ECENOIC ACID ME 


C16H30O2 


254 


fatty acid 


209 


CIS-1 1 ,1 4,17-EICOSATRIENOIC ACID ME 


C21H3602 


320 


fatty acid 


206 


CIS-1 1-EICOSENOIC ACIDME 


C21H40O2 


324 


fatty acid 


213 


CIS-1 3,1 6-DOCOSADIENCHC ACID ME 


C23H42Q2 


350 


fatty acid 
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212 


CIS-13-DOCOSENOIC ACID ME 


C23H4402 


352 


fatty acid 


215 


CIS-15-TETRACOSENOIC ACID ME 


C25H4802 


380 


fatty acicf 


202 


aS-6,9,12-OCTADECATF.!ENO[CACIDME 


C19H3202 


292 


fatty acid 


208 


CIS-8,1 1 ,14-BCOSATRIENOIC ACID ME 


C21H3602 


320 


fatty acid 


204 


CIS-9,1 2,1 5-OCTADECATIENOIC ACID ME 


C19H3202 


292 


fatty acid 


201 


CIS-9,12-OCTADECADIENOIC ACID ME 


C19H3402 


294 


fatty acid 


180 


CIS-9-HEXADECENOIC ACID ME 


C17H3202 


268 


fatty acid 


183 


CIS-9-OCTADECENOIC ACID ME 


C19H3602 


296 


fatty acid 


176 


CIS-9-TETRADECENOIC ACID ME 


C15H28C-2 


240 


fatty acid 


211 


DOCASANOICACID ME 


C23H4602 


354 


fatty acid 


172 


DODECANCHCACID ME 


C13H26Q2 


214 


fatty acid 


205 


BCOSANOICACIDME 


C21H4202 


326 


fatty acid 


210 


HENEICOSANOIC ACID ME 


C22H4402 


340 


fatty acid 


181 


HEPTADECANOIC ACID ME 


C18H3602 


284 


fatty acid 


179 


HEXADECANOIC ACID ME 


C17H3402 


270 


fatty acid 


170 


NOIMADECANOIC ACID ME 


C20H40O2 


312 


fatty acid 


200 


OCTADECANOICACID ME 


C19H3802 


298 


fatty add 


171 


PENTACOSANOICACID ME 


C26H5202 


396 


fatty acid 


177 


PENTADECANOIC ACID ME 


C16H3202 


256 


fatty acid 


216 


TETRACOSANOIC ACID ME 


C25H50O2 


382 


fatty acid 


175 


TETRADECANOIC ACID ME 


C15H30O2 


242 


fatty acid 


203 


TRANS-9,12-OCTADECADIENOIC ACID ME 


C19H34Q2 


294 


fatty acid 


184 


TRANS-9-OCTADECENOIC ACID ME 


C19H3602 


296 




214 


TRICOSANOIC ACID ME 


C24H4802 


368 


fatty acid 


174 


TRIDECANOIC ACID ME 


C14H2802 


228 


fatly acid 


264 


UNDECANOICACID ME 


C12H2402 


200 


fatty acid 


266 


CIS-1 1,14-EICOSADIENOIC ACID ME 


C21H3802 


322 


fatty acid 


273 


3HO-16:1 METMS 


C20H40O3Si 


356 


hydroxy-carboxylic acid 


259 


3HO-165METMS 


C20H38O3Si 


354 


hycfroxy-carboxylic acid 


260 


3HO-163 METMS 


C20H36O3Si 


352 


hydroxy-caiboxylic acid 


272 


3HO-16:0METMS 


C20H42O3Si 


358 


hydroxy-carboxylic acid 


125 


PHOSPHORIC ACID.O.O.O-TMS 


C9H2704PSS3 


314 


inorganic acid 


278 


D-MANNONO-DELTA-LACTAM 4TMS 


C18H43N05SI4 


465 




277 


M ' N( »r ORTA I \C ) \MMEOX4TMS 


C19H46N205SI4 


494 


lactam 


62 


ARABINOSE MEOX1 4TMS 


C18H45N05SI4 


467 


monosaccharide 


63 


ARABINOSE MEOX2 4TMS 


C18H45N05Si4 


467 


monosaccharide 


64 


ARABINOSE MEOX2 4TMS 


C18H45N05Si4 


467 


monosaccharide 


50 


ERYTHROSE MEOX1 3TMS 


C14H35N04Si3 


365 


monosaccharide 


51 


ERYTHROSE MEOX2 TMS 


C14H35N04Si3 


365 


monosaccharide 


46 


FRUCTOSE MEOX1 5TMS 


C22H55NC€Si5 


569 


monosaccharide 


47 


FRUCTOSE MEOX2 5TMS 


C22H55N06S6 


569 


monosaccharide 


58 


FUCOSEMEOX1 4TMS 


C19H47N05SI4 


481 


monosaccharide 


59 


FUCOSE MEOX2 4TMS 


C19H47N05Si4 


481 


monosaccharide 


56 


GALACTOSE MEOX1 TMS 


C22H55N06Si5 


569 


monosaccharide 


57 


GALACTOSE MEOX2 TMS 


C22H55N06SB 


569 


monosaccharide 




GLUCOHEPTULOSE MEOX1 TMS 






monosaccharide 


142 


GLUCOHEPTULOSE MEOX2 TMS 






monosaccharide 


138 


GLUCOSAMINE MEOX1 TMS 






monosaccharide 


137 


GLUCOSAMINE MEOX2 TMS 






monosaccharide 




GLUCOSE MEOX1 5TMS 


C22H55N06SB 


569 


monosaccharide 


42 


GLUCOSE MEOX2 5TMS 


C22H55N06SI5 


569 


monosaccharide 


37 


GLYCERALDEHYD MEOX1 2TMS 


C10H25NO3SI2 


263 


monosaccharide 


38 


GLYCERALDEHYD MEOX2 TMS 


C10H25NO3S12 


263 


monosaccharide 


;Vh 


LYXOSE,0,0,0,0-TMS MEOX1 


Cl8H45N05Si4 


467 


monosaccharide 


280 


LYXOSE,0,0,0,0-TMS MEOX2 


C18H45N05Si4 


467 


monosaccharide 


45 


MANNOSE MEOXTMS 


C22H55N06Si5 


569 


monosaccharide 




RHAMNOSE MEOX1 4TMS 


C19H47N05Si4 


481 


monosaccharide 


61 


RHAMNOSE MEOX2 4TMS 


C19H47NC6Si4 


481 


monosaccharide 


52 


RIBOSE MEOX1 4TMS 


C18H45N05Si4 


467 


monosaccharide 


53 


RIBOSE MEOX2 4TMS 


C18H45N05Si4 


467 


monosaccharide 


54 


XYLOSE MEOX1 4TMS 


C18H45N05Si4 


467 


monosaccharide 


55 


XYLOSE MEOX2 4TMS 


C18H45N05SI4 


467 


monosaccharide 


192 


XYLULOSE MEOX4TMS 


C18H45N05Si4 


467 


monosaccharide 


107 


2-OXOBUTYRIC ACID MEOX1 TMS 


C8H17N03Si 


203 


organic acid 


160 


2-OXOVALERIC ACID MEOX2 TMS 


C9H19N03SI 


217 


organic acid 


120 


I DROXYF RUvIC CII MEOX1TMS 


C10H23NO4Si2 


277 


organic acid 


121 


Hfpnoy.PYnu^L a 1 -idmeox2tms 


C10H23NO4Si2 


277 


organic acid 


122 


3-OXOGLUTARICACID MEOX1 TMS 


C12H25N05SI2 


319 


organic acid 


123 


3-OXOGLUTARlC ACID MEOX2 TMS 


C12H25N05SI2 


319 


organic acid 


166 


4-AMINOBUTYRIC ACID 3TMS 


C13H33N02Si3 


319 


organic acid 
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119 


4-HYDROXYBENZCHC ACIDAO-TMS 


C13H2203Si2 


282 


organic acid 


288 


6-HYDROXYNICOTINIC ACID.O.O-TMS 


C12H21N03Si2 


283 


organic acid 


148 


ACONUICACID 3TMS 


C15H30O6Si3 


390 


organic acid 


109 


ALPHA-KETOGLUTARIC ACID WIEOX1 2TMS 


C12H25N05SI2 


319 


organic acid 


110 


ALPHA-KETOGLUTARIC ACID WIEOX22TMS 


C12H25N05SI2 


319 


organic acid 


99 


ASCORBIC ACID TMS 






organic acid 


101 


BENCOICACIDTMS 


C10H14O2SI 


194 


organic acid 


97 


CITRIC ACID TMS 


C18H40O7Si4 


480 


organic acid 


282 


FERULIC ACIDAO-TMS 


C16H2604Si2 




organic acid 


95 


FUMARIC ACID TMS 


C10H20O4Si2 


260 


organic acid 


111 


GALACTURONIC ACID MEOX1 5TMS 


C22H53N07Si5 


583 


organic acid 


112 


GALACTURONIC ACID MEOX2 5TMS 


C22H53N07SI5 


583 


organic acid 


115 


GLUCARIC ACID 1,4LACTONE MEOX1 TMS 






organic acid 


116 


GLUCARICACID 1.4LACTONE MEOX2TMS 






organic acid 


147 


GLUCOHEPTONIC ACID TMS 






organic acid 


198 


GLUCOHEPTONIC ACID TMS 






organic acid 


114 


GLUCONIC ACID LACTONE TMS 






organic acid 


113 


GLUCONIC AClD,0,O t O,0,0.0-TMS 






organic acid 


187 


GLUCURONIC ACID MEOX1 5TMS 


C22H53N07SI5 


583 


organic acid 


188 


GLUCURONIC ACID MEOX2 5TMS 


C22H53N07Si5 


583 


organic acid 


163 


GLUCURONIC ACID-E-LACTONE MEOX TMS 






organic acid 


117 


GLUTARIC ACID,0,0-TMS 


C11H2404Si2 


276 


organic acid 


118 


GLYCERIC ACID.O.O.O-TMS 


C12H30O4Si3 


322 


organic acid 


330 


GLYCOUC ACIDAO-TMS 


C8H20O3SI2 


220 


organic acid 


162 


GLYOXYUC ACID MEOX TMS 


C6H13N03Si 


175 


organic acid 


262 


GLYOXYUC ACID MEOX TMS 


C6H13N03SI 


175 


organic acid 


263 


GLYOXYLIC ACID MEOX TMS 


C6H13N03Si 


175 


organic acid 


329 


HEXANOICACID.O-TMS 


C9H20O2Si 


188 


organic acid 


124 


HYDROXYBUTANOIC ACIDAO-TMS 


C10H24O3Si2 


248 


organic acid 


165 


INDOLE-3-ACETIC ACID 2TMS 


C16H25N02Si2 


319 


organic acid 


139 


ISOC1TRIC ACID TMS 


C18H40O7Si4 


480 


organic acid 


328 


LACTIC ACID AO VMS 


C9H2203Si2 


234 


organic acid 


152 


LAURIC ACID TMS 


C15H3202SI 


272 


organic acid 




MALIC ACID TMS 


C13H30O5Si3 




organic acid 


261 


NICOTINIC ACID TMS 


C9H13N02SI 


195 


organic acid 


98 


OXALIC ACID TMS 


C8H1804SI2 


234 


organic acid 


105 


PYRUVIC ACID MEOX TMS 


C7H15N03S! 


189 


organic acid 


104 


QUIN1C ACID TMS 


C22H5206Si5 


552 


organic acid 


143 


SACCHARIC ACID TMS 






organic acid 


106 


SALICYLIC ACID 2TMS 


C13H2203SI2 


282 


organic acid 


96 


SHIKIMIC ACID TMS 


C19H4205SI4 


462 


organic acid 


281 


SINAPINIC ACIDAO-TMS 


C17H2805SI2 


368 


organic acid 


133 


SUCCINIC ACID 2TMS 


C10H22O4Si2 


262 


organic acid 


100 
289 


TARTARIC ACID AOAO-TMS 
THREONIC ACID AOAO-TMS 


C16H3806Si4 
C16H40O5SI4 


438 


organic acid 


156 


2-METHYLMALIC ACID 3TMS 


C14H3205Si3 


424 
364 


organic acid 
organic acid 


227 


2-OXOISOCAPROIC ACID MEOX1 TMS 


C10H21NQ3Si 


231 


organic acid 


157 


2-OXOISOCAPROIC ACID MEOX2 TMS 


C10H21NO3Si 


231 


organic ackf 


158 


2-OXOISOVALERIC ACID MEOX1 TMS 


C9H19N03Si 


217 


organic acid 


228 


2-OXOISOVALERIC ACID MEOX2 TMS 


C9H19N03Si 


217 


organic acid 


229 


2-OXOVALERlC ACID MEOX1 TMS 


C9H19N03SI 


217 


organic acid 


131 


3-HYDROXY PYRIDINE ATMS 


C8H13NOSI 


167 


organic compound 




4-HYDROXYPYRIDINE.O-TMS 


C8H13NOS! 


167 


organic compound 


299 


ALLANTOIN,N,N,N,N,N-TMS 


C19H46N403Si5 


518 


orj anic compound 


298 


ALLANTOIN,N,N,N,N-TMS 


C16H38N4Q3Si4 


446 


organic compound 


300 


ALLANTOIN,N,N,N-TMS 


C13H30N4O3SI3 


374 


organic compound 


297 


URICACID,N,N,OATMS 


C17H36N403SI4 


456 


organic compound 


151 


2-PHOSPHOGLYCERATE TMS 


C15H3907PSi4 




phosphorylated 
compound 


39 


3-PHOSPHOGLYCERATE TMS 


C15H3907PSi4 


474 


phosphorylated 
compound 


49 


FRUCTOSE-6-PHOSPHATE MEOX2 TMS 






phosphorylated 


48 


FRUCTOSE-6-PPHOSPHATE MEOX1 TMS 






phosphorylated 
compound 


43 


GLUCOSE-6-PHOSPHATE MEOX1 TMS 






phosphorylated 
compound 


44 


GLUCOSE-6-PHOSPHATE MEOX2 TMS 






phosphorylated 
compound 


128 


GLUCURONIC ACID-6-PHOSPHATETMS 






phosphorylated 
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40 


GLYCERO-3-PHOSPHATE 4TMS 


C15H4106PSi4 


460 


phosphorated 
compound 




GLYCEROL-2-PHOSPHATE O O O O-TMS 


C15H4106PSi4 


460 


phosphorated 
compound 


144 


RIBOSE-5-PHOSPHATE TMS 






phosphorylated 
compound 


196 


SORBITOL-6-PHOSPHATE TMS 






phosphorylated 
compound 


146 
155 


SUCROSE-6-PHOSPHATE TMS 
CYTOSINE 2TMS 






phosphorytated 
compound 


164 


METHYLCYTOSINE 2TMS 


C11H23N30Si2 




pyrimidine 

pyrimidine 


154 


THYMINE 2TMS 


C11H22N202SE 


270 


pyrimidine 


237 


ALPHA-TOCOPHEROL 1TMS ~ 
BETA-SITOSTEROL 1TMS 


C32H5802SI 


502 


steroid 


234 


CAMPESTEROL 1TMS 


C32H580Si 
C31H5SOSi 


472 


steroid 


232 


CHOLESTEROL 1TMS 


C30H54OSi 






233 


LANOSTEROL 1TMS 


C33H5SOSi 


498 


steroid 


236 


STIGMASTEROL 1TMS 


C32H560Si 






88 


ERYTHRITOL TMS 






sugar alcohol 


90 


GALACTITOL TMS 






sugar alcohol 


92 


INOSITOL,0,0,0,0,O,0-TMS 


C24H60O6Si6 




sugar alcohol 


185 


LACTITOL TMS 
MALTITOL TMS 






sugar alcohol 


91 


MALTOTRITOL TMS 






sugar alcohol ■ 


86 
193 


MANNITOLTMS 
ONONITOLTMS 


C2?H5406Si5 




sugar alcohol 
sugar alcohol 


87 
140 


SORBTOL TMS 
XYLITOL 5TMS 


C20H52O5Si5 


512 


sugar alcohol 
sugar alcohol 


84 


CELLOTRIOSE MEOX1 TMS 






trisaccharidG 


85 


CELLOTRIOSE MEOX2 TMS 






trisaccharide 


83 


ISOMALTOTRIOSE TMS 






trisaccharide 


80 


MALTOTRIOSE MEOX1 TMS 






trisaccharide 




MALTOIRIOSE MEOX2 rMS 






trisaccharide 


82 


MELEZfTOSE TMS 






trisaccharide 


275 


1.4-DIDEOXY-1 ,4-IMINO-D-ARABINITOL 3TMS 


C14H35N03Si3 


349 




276 


1 ,4-DIDEOXY-l ,4-IMINO-D-ARABINITOL 3TMS 


C14H35N03Si3 


349 




325 


2-AMINO-2-METHYL-1,3-PROPANDIOL 3TMS 


C13H35N02Si3 


321 




302 


3-METHYLAMIN0L-1,2-PROPAND!0L,N,O,O- 
TMS 


C13H35N02Si3 


321 




301 
290 


3-METHYLAMINOL-1,2-PROPANDIOL,0,0-TMS 
THRE0NINOL,O,O-TMS 


C10H27NO2Si2 
C13H35N02Si3 


249 
321 




199 


UREA 2TMS 


C7H20N2OSi2 


204 
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Table 2 providing a list of the proteins identified in extracts from C24 and 
Col2 leaf tissue as described in Example 2 



Protein ID 


Function 


Peptides 
identified 


Found in 


NP_051067 


ribfose 1 ,5-bisphosphate carboxylase/oxygenase large chain 


23 


Col2/C24 


NP_051066 


ATP synthase CF1 beta chain 


10 


Col2/C24 


S04048 


ribulose-bisphosphate carboxylase activase (EC 6.3.4.-) precursor 


8 


Col2/C24 


NP_051044 


ATPase alpha subunit 


13 


Col2/C24 


AAL38341 


chlorophyll a/b-binding protein 


5 


Col2/C24 


CAB40384 


16 kDa polypeptide of oxygen-evolving complex 


9 


Col2/C24 


At3g01500 


carbonic anhydrase, chloroplast precursor 


8 


Col2/C24 


At3g60750 


transketolase - like protein 


6 


Col2/C24 


At4g10340 


light-harvesting chlorophyll a/b binding protein 


7 


Col2/C24 


GCST_MESCR 


Aminomethyltransferase, mitochondrial precursor (Glycine cleavage 
system T protein) (GCVT) 


7 


Col2/C24 


AAD10209 


glyceraldehyde 3-phosphate dehydrogenase A subunit 


4 


Col2/C24 


At2g30860 


glutathione transferase, putative 


4 


Col2/C24 


At1g06680 


photosystem II oxygen-evolving complex 23 (OEC23) 


5 


Col2/C24 


AT5g25980 


myrosinase TGG2 


5 


Col2/C24 


At5g35630 


glutamate-ammonia ligase (EC 6.3.1.2) precursor, chloroplast 


4 


Col2/C24 


S08534 


translation elongation factor eEF-1 alpha chain (gene A4) 


3 


Col2/C24 


At3g55800 


sedoheptulose-bisphosphatase precursor 


3 


Col2/C24 


At1g03130 


putative photosystem I reaction center subunit II precursor 


4 


Col2/C24 


AAM12979 


chlorophyll a/b-binding protein CP29 


4 


Col2/C24 


At3g26060 


putative peroxiredoxin 


4 


Col2/C24 


NP_051054 


photosystem II protein D2 


3 


Col2/C24 


At2g40840 


glycosyl hydrolase family 77 (4-alpha-glucanotransferase) 


1 


Col2/C24 


BAB08951 


2-cys peroxiredoxin-iike protein 


3 


Col2/C24 


AAM62639 


unknown 


2 


C0I2/C24 


At1g12900 


putative calcium-binding protein, calreticulin 


1 


Col2/C24 


At1g44575 


photosystem II 22kDa protein, putative 


4 


Col2/C24 


At4g05180 


oxygen-evolving complex protein 16, chloroplast precursor (OEC16) 


4 


Col2/C24 


NP_051072 


cytochrome f 


3 


Col2/C24 


AF217459_1 


heat shock protein 70 


2 


Col2/C24 


At1g64290 


hypothetical protein 


1 


Col2/C24 


At5g26000 


glycosyl hydrolase family 1 , myrosinase precursor 


3 


Col2/C24 
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At3g57190 


putative protein 


1 


OOkVO^H 


AAK64040 


unknown protein 






AF428455 1 


putative fructose-bisphosphate aldolase 




UOIZ/(_«i4 


am n^oncn 


phosphoribubkinase precursor 


2 


Col2/C24 


BAA20945 


beta subunit of coupling factor one 




\j\J\dAjCH 


At9rt^Q7^n 

Ai^goy / ou 


auxin-regulated protein 




COI2/C24 


At1g20020 


ferredoxin— NADP reductase precursor, putative 




f^r\\Olf">A 


At3g08940 


putative chlorophyll a/b-binding protein 




C\n\0/f^OA 


NP 051055 


photosystem II 44 kDa protein 




L>ol£/L>£4 


Aioguooou 


expressed protein, supported by cDNA: gi_1 43351 55 


1 


C0I2/C24 


Af>n*tyt'*ftn 


putative retroelement pol polyprotein 




U0liVO24 


Aiog*i04yu 


Non-phototropic hypocotyl, putative 




C0I2/C24 


Aiogooozu 


selenium-binding protein-like 




C0I2/C24 


CAB52747 


photosystem I subunit III precursor 




C0I2/C24 


AAr\Ooo 1 o 


H+-transporting ATP synthase-like protein 




C0I2/C24 


At'^n'Jo^on 
MlogiZOZU 


unknown protein 




C0I2/C24 


AlogOO I4U 


mRNA binding protein precursor - like 




C0I2/C24 


AH nVOH/in 

ax ig/yu4U 


photosystem li polypeptide, putative 




C0I2/C24 


Af^vnOnoQn 
Aiogzu^yu 


putative protein 




C0I2/C24 


ai t g*fy «cyu 


hypothetical protein 




C0I2/C24 


At5g13160 


protein kinase-like 




COI2/C24 


At5g24630 


unknown protein 




C0I2/G24 


mi fyooyyu 


hypothetical protein 




Coi2/C24 




Similar to part of disease resistance protein pmported] 




Col2/C24 


T05822 


hypothetical protein T5K18.170 




Col2/C24 


AloyO i OoU 


potential calcium-transporting ATPase 11, plasma membrane-type 
(Ca2+-ATPase, isoform 11) 




Col2/C24 


At4g25430 


hypothetical protein 




Col2/C24 


At1 g1 6240 


expressed protein 




Col2/C24 


AT [940Z0U 


expressed protein 




Col2/C24 


AAM62447 


glycine-rich RNA binding protein 7 




Col2/C24 


BAB08888 


gene_id:MIJ24.6~ref|NP_013897.1~similarto unknown protein 




Col2/C24 


At3g14420 


glycolate oxidase, putative 




Col2/C24 


AAM65044 


60S acidic ribosomal protein P2 




Coi2/C24 


At2g29450 


glutathione transferase (103-1A) 




Col2/C24 


AAM98072 


unknown protein 




Col2/C24 


AF428455J 


putative fructose-bisphosphate aldolase 




Col2/C24 
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AwgoUti/iU 


photosystem II oxygen-evolving complex 33 (OEC33) 






At3g14415 


glycolate oxidase 




Col2/C24 




expressed protein 




Col2/C24 


At2g1 3360 


alanine-glyoxylate aminotransferase 




UOI<£/0<£4 


At5g42650 


allene oxide synthase 






MlZgUOl uu 


Light-harvesting chlorophyll a/b binding protein 






Aiogoo/ou 


putative protein 




C0I2/C24 


AMg44oyu 


RP19 gene for chloroplast ribosomal protein CL9 




CrAOtf^OA 
U0liifO^4 


At3g45590 


putative protein 




C0I2/C24 


At5g56810 


F-box protein 






At1g69070 


hypothetical protein 




C0I2/C24 


At2g 15325 


hypothetical protein 




C0I2/C24 


At3g63190 


putative protein 




C0I2/C24 


At1g13800 


hypothetical protein 




C0I2/C24 


At3g30843 


hypothetical protein 






S49030 


RNA-bindlng protein RNP-D precursor 




C0I2/C24 


AAM66135 


unknown 




C0I2/C24 


At4g37460 


putative protein 




C0I2/C24 


At5g44870 


disease resistance protein (TIR-NBS-LRR class), putative 




C0I2/C24 


At2g47610 


60S ribosomal protein L7A 




C0I2/C24 


At5g25590 


putative protein 




U0l<i/O«i4 


AAM63618 


putative rubisco subunit binding-protein alpha subunit 




Col2/C24 


At5g23700 


putative protein 




Col2/C24 


NP_051045 


ATP synthase CFO B chain 




Col2/C24 


At3g23400 


expressed protein 




Col2/C24 


At2g40630 


expressed protein 




Col2/C24 


At4g22780 


Translation factor EF-1 alpha - like protein 




Col2/C24 


At1 g04800 


unknown protein 




Col2/C24 


At1 g20060 


kinesin-related protein 




Col2/C24 


At5g60120 


APETALA2 protein - like 




Col2/C24 


At2g01 620 


expressed protein 




Col2/C24 


At2g27000 


cytochrome p450 family 




Col2/C24 


At5g17370 


hypothetical protein 




C0I2/C24 


At5g09660 


microbody NAD-dependent malate dehydrogenase 




Col2/C24 


At4g31050 


putative protein 




COI2/C24 


At2g37310 


hypothetical protein 




COI2/C24 


At5g49120 


putative protein 




C0I2/C24 
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At3g24550 


protein kinase, putative 




Col2/C24 


At2g26940 


putative C2H2-type zinc finger protein 




Col2/C24 


T51531 


biotin carboxyi carrier protein homolog T20K14.140 pmportedj 






At5g16860 


putative protein 




C0I2/C24 


BAB10393 


contains similarity to En/Spm-like transposon 






At4g18820 


putative protein 




COI2/C24 


At5g01730 


putative protein 




Col2/C24 


At1g80910 


myrosinase precursor, putative 




Col2/C24 


At2g05170 


expressed protein 




C0I2/C24 


At5g42920 


putative protein 




l_<OL£/tjii4 


At5g51200 


putative protein 




O0i<£/o^4 


At3g53720 


putative protein 






At5g22450 


putative protein 






At1g22410 


3-deoxy-D-arabino-heptulosonate 7-phosphate, putative 






At4g30990 


putative protein 






At3g20860 


putative serine/threonine protein kinase 




C0I2/C24 


At3g05470 


unknown protein 






At1g06380 


hypothetical protein 




C0I2/C24 


At3g47140 


putative protein 




C0I2/C24 


At5g01630 


putative protein 




C0I2/C24 


At5g39960 


putative protein 




C0I2/C24 


At2g35300 


similar to late embryogenesis abundant proteins 




C0I2/C24 


At4g30830 


putative protein 




CrAOlf^OA 
OOl^O^H 


At1g68940 


hypothetical protein 






At1g79680 


WAK-like kinase (WLK) 




C0I2/C24 


At3g66658 


betaine aldehyde dehydrogenase, putative 




r*r\\oic^r>A 
OOLi/U^-^ 


At4g19320 


hypothetical protein 






At3g49350 


GTPase activating -like protein 




VjOLalv*^ 


At1g16140 


WAK-like kinase (WLK) 




C0I2/C24 


At3g04740 


hypothetical protein 




C0I2/C24 


At3g60890 


putative protein 




C0I2/C24 


At2g07010 


putative retroelement pol polyprotein 




C0I2/C24 


At5g02060 


putative protein 




COI2/C24 


At3g60310 


putative protein 




COI2/C24 


AAA32797 


geranylgeranyl pyrophosphate synthase 




Cot2/C24 


BAB09274 


histidine kinase-like protein 




C0I2/C24 


At1g72500 


hypothetical protein 




COI2/C24 
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At3g42320 


putative protein 


1 


Col2/C24 


H86321 


hypothetical protein F6A14.10 [imported] 


1 


Col2/C24 


At5g58980 


random slug protein - like 


1 


Col2/C24 


At5g14350 


putative protein 


1 


Col2/C24 


At4g27720 


putative protein 


1 


Col2/C24 




L-ascorbate peroxidase (EC 1.11.1.11) precursor 




Col2/C24 


At3g62410 


CP12 protein precursoNike protein 


. 


Col2/C24 


At3g49120 


peroxidase, putative 


1 


Col2/C24 


At3g11540 


spindly (gibberellin signal transduction protein) 


1 


Col2/C24 


AAB07026 


catalase 


3 


Col2/C24 


At4g33010 


P-Protein - like protein 


3 


Col2/C24 


RBS1_ARATH 


Ribulose bisphosphate carboxylase small chain 1 A, chloroplast 
precursor (RuBisCO small subunit 1A) 


6 


Col2/C24 


At3g45140 


lipoxygenase AtLOX2 


7 


Col2 


S11852 


photosystem II oxygen-evolving complex protein 1 precursor 


8 


Col2 


At5g54270 


light-harvesting chlorophyll a/b binding protein, putative 


3 


Col2 


JQ1286 


glyceraldehyde-3-phosphate dehydrogenase (NADP) 
(phosphorylating) (EC 1.2.1.13) B precursor, chloroplast 


8 


Col2 


At1g56190 


phosphoglycerate kinase, putative 


7 


Col2 


At3g04120 


glyceraldehyde-3-phosphate dehydrogenase C subunit (GapC) 


3 


Col2 


At2g02930 


glutathione transferase, putative 


5 


Col2 


AAA50156 


carbonic anhydrase 


5 


Col2 


AT4g37930 


glycine hydroxymethyltransferase-like protein 


5 


Col2 


At4g04640 


coded for by A. thaliana cDNA AA041 141 


5 


Col2 


T52072 


hypothetical protein g5bf [imported] 


5 


Col2 


NP_051058 


photosystem I P700 apoprotein A2 


5 


Col2 


AT3g48870 


AtClpC endopeptidase Clp ATP-binding chain C 


5 


Col2 


T12970 


hypothetical protein T6H20.190 


3 


Col2 


A96602 


elongation factor EF-2 [imported] 


5 


Col2 


At2g39730 


Rubisco activase 


3 


Col2 


CAA70862 


ferredoxin-dependent glutamate synthase 


4 


Col2 


AF326861 1 


putative photosystem I subunit PSI-E 






AAN31836 


putative 5-methyltetrahydropteroyltriglutamate-homocysteine S- 
methyltransferase 


4 


Col2 


AT4g38970 


putative fructose-bisphosphate aldolase 


2 


Col2 


T52314 


chlorophyll a/b-binding protein Lhcb6 pmported] 


4 


Col2 


At2g35370 


glycine decarboxylase complex H-protein 


2 


Col2 
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NP051084 


photosystem II 47 kDa protein 


2 


Col2 


At4g13400 


putative protein 


1 


Col2 


AAN31832 


putative chloroplast translation elongation factor EF-Tu precursor 


3 


Col2 


At5g38410 


ribulose bisphosphate carboxylase small chain 3b precursor 


3 


Col2 


AAA32813 


plasma membrane proton pump H+ ATPase 


3 


Col2 


NP_051039 


photosystem II protein D1 


3 


Col2 


Al4g18480 


protein ch-42 precursor, chloroplast 


3 


Col2 


S16582 


fructose-bisphosphatase (EC 3.1.3.1 1) precursor, chloroplast 


4 


Col2 


At5g47210 


putative protein 


2 


Col2 


JT0901 


chaperonin 60 beta precursor 


3 


Col2 


CAA74895 


ribosomal protein L4 


1 


Col2 


At1g76030 


vacuolar ATP synthase subunit B 


2 


Col2 


S33707 


DNA-damage repair protein DRT1 12 precursor 


1 


Col2 


AAN31859 


putative heat shock protein 81-2 (HSP81-2) 


3 


Col2 


AAM63250 


cyanate lyase 


1 


Col2 


AF360195_1 


putative alanine aminotransferase 


2 


Col2 


At1g53310 


phosphoenolpyruvate carboxylase 1, putative 


2 


Col2 


At2g26080 


putative glycine dehydrogenase 


2 


Col2 


At1g04410 


putative malate dehydrogenase 


2 


Col2 


At1g50250 


chloroplast FtsH protease 


2 


Col2 


G84888 


probable transketolase precursor [imported] 


2 


Col2 


AAK73957 


putative ftsH chloroplast protease 


3 


Col2 


AF083913_1 


annexin 


2 


Col2 


At1g77160 


hypothetical protein 


1 


Col2 


At3g17820 


glutamine synthetase, putative 


2 


Col2 


C84582 


hypothetical protein At2g19880 [imported] 


1 


Col2 


At2g41560 


potential calcium-transporting ATPase 4, plasma membrane-type 
(Ca2+-ATPase, isoform 4) 


2 


Col2 


AAM62764 


glutamine synthetase, putative 


2 


Col2 


At1g24706 


F5A9 .21, unknown 


1 


Col2 


At2g22010 


unknown protein 


1 


Col2 


At5g14740 


carbonic anhydrase 2 




Col2 


At1g76180 


dehydrin, putative 


2 


Col2 


AF428455_1 


putative fructose-bisphosphate aldolase 


1 


Col2 


At4g31700 


ribosomal protein S6 - like 


1 


Col2 


At1g74060 


putative 60S ribosomal protein L6 


1 


Cot2 


At3g47070 


putative protein 


2 


Col2 
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At3g58730 


v-ATPase subunit D (vATPD) 


2 


Col2 


At1g23740 


putative auxin-induced protein 




Col2 


CAA11554 


2-oxoglutarate dehydrogenase, E3 subunit 


2 


Col2 


At2g21170 


putative triosephosphate isomerase 


2 


Col2 


At3g46520 


actin12 


1 


Col2 


At1g19570 


dehydroascorbate reductase, putative 


2 


Col2 


AAM97062 


unknown protein 


1 


Col2 


S71112 


catalase (EC 1.11.1.6) 3, peroxisome/glyoxysome location signal 
(S4RKH]-L) motif 


1 


Col2 


At4g13940 


adenosylhomocysteinase 


2 


Col2 


NP_051087 


photosystem II phosphoprotein 


1 


Col2 


At2g15620 


ferredoxin-nitrite reductase 


1 


Col2 


At3g47470 


light-harvesting chlorophyll a/b binding protein 


1 


Col2 


At4g35090 


catalase 2 


1 


Col2 


S19226 


cold-regulated protein cor47 


2 


Col2 


CAC35872 


H+-transporb"ng ATP synthase beta chain (mitochondrial)-like 
protein 


2 


Col2 


AAB80700 


glycolate oxidase 


1 


Col2 


At4g09000 


14-3-3 protein GF14 chi (grfl) 


1 


Col2 


At2g34430 


photosystem II type I chlorophyll a lb binding protein 


1 


Col2 


At4g34870 


peptidylprolyl isomerase (cyclophilin) 


2 


Col2 


At1g16880 


expressed protein 


2 


Goi2 


At2g37660 


expressed protein 


2 


Col2 


AF36Q228_1 


putative glutathione reductase 


1 


Col2 


At3g49910 


60S ribosomal protein - like 


2 


Col2 


At5g15980 


putative protein 


2 


Col2 


T52122 


chaperonin 10 


2 


Col2 


At3g07570 


unknown protein 


1 


Col2 


AAM13161 


ATP-dependent transmembrane transporter, putative 


1 


Col2 


AAB09585 


ADP glucose pyrophosphorylase small subunit 


2 


Col2 


At3g02090 


putative mitochondrial processing peptidase 


2 


Col2 


At4g03430 


putative pre-mRNA splicing factor 






At1g71240 


hypothetical protein 


1 


Col2 


At4g31700 


ribosomal protein S6 - like 


1 


Col2 


AAK59424 


putative DEF (CLA1) protein 


1 


Col2 


At5g55180 


glycosyl hydrolase family 17 


1 


Col2 


At1g74770 


hypothetical protein 


1 


Col2 
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AC012394J7 


putative phytochrome A signaling protein 




Col2 


At2g24820 


putative Rieske iron-sulfur protein 




Col2 


At2g36380 


ABC transporter family protein 




Col2 


At2g35120 


glycine decarboxylase complex H-protein 




Col2 


AAL32516 


putative protein 




Col2 


At1g50730 


hypothetical protein 




Col2 


At2g14470 


putative helicase 




Col2 


At1g67560 


putative lipoxygenase 




Col2 


At1g56190 


phosphoglycerate kinase 




Col2 


At4g12180 


putative reverse transcriptase 




Col2 


At3g51560 


disease resistance protein (TIR-NBS-LRR class), putative 




Col2 


At1g50120 


hypothetical protein 




Col2 


G86301 


probable retroelement polyprotein [imported] 




Cot? 


At5g16500 


protein kinase-like protein 




Col2 


At1g60860 


GCN4-complementing protein 




Col2 


CAB80674 


putative protein transport factor 




Col2 


At2g34610 


hypothetical protein 




Col2 


T01733 


hypothetical protein AJG002N01 .31 




Col2 


At2g07698 


hypothetical protein 




Col2 


At5g10790 


ubiquitin-specific protease 22 (UBP22) 




Col2 


At1g62810 


amine oxidase, putative 




Col2 


AC069473_9 


unknown protein 




Col2 


At1g73980 


unknown protein 




Col2 


T50928 


calmodulin-binding protein pmportedj 




Col2 


AAM62795 


60S ribosomal protein L27A 




Col2 


At5g37670 


Low-molecular-weight heat shock protein - like 




Col2 


At5g48010 


pentacyclic triterpene synthase (04C1 1) (ATPEN1), putative 




Col2 


At2g43560 


FKBP-type peptldyl-prolyl cis-trans isomerase 




Col2 


AC007354J0 


Strong similarity to gb|Y09533 involved in starch metabolism from 
Solanum tuberosum 




Col2 


At1g13790 


hypothetical protein 




Col2 


At2g19380 


RRM-containing RNA-binding protein 






At5g06240 


unknown protein 




Col2 


At1g67240 


mutator-like transposase, putative 




Col2 


CAA69802 


ATPase subunit 1 




Col2 


At4g20890 


tubulin beta-9 chain 




Col2 


At2g40590 


40S ribosomal protein S26 




Col2 
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BAA97188 


emb|CAB87273.1~geneJd:MMI9.7~similar to unknown protein 




Col2 


At5g09860 


expressed protein 




Col2 


At1g60630 


leucine-rich repeat transmembrane protein kinase 




Col2 


At1g02500 


s-adenosylmethionine synthetase 




Col2 


AF462865J 


unknown protein 




Col2 


AF424618J 


membrane-associated salt-inducible protein 




Col2 


At1g05530 


UDP-glycosyltransferase f am j|y 




Col2 


At1g74680 


Exostosin family 




Col2 


At1g32470 


glycine cleavage system H protein precursor, putative 




Col2 


At2g47470 


putative protein disulfide-isomerase 




Col2 


T48997 


epsin-like protein 




Col2 


AAK96795 


acyl carrier protein (ACP) gene 




Col2 


At2g16890 


putative glucosyltransferase 




Col2 


AAL91646 


unknown protein 




Col2 


At5g59660 


leucine-rich repeat transmembrane protein kinase, putative 




Col2 


At5g66190 


ferredoxin-NADP+ reductase 




C24 


At3g22910 


potential calcium-transporting ATPase 13, plasma membrane-type 
(Ca2+-ATPase, isoform 13) 




C24 


At4g28750 


photosystem I subun'rt PSI-E - like protein 




C24 


At5g48310 


putative protein 




C24 


At5g28300 


GTL1 - like protein 




C24 


At1g29930 


light-harvesting chlorophyll a/b binding protein 




C24 


At5g09660 


microbody NAD-dependent malate dehydrogenase 




C24 


At3g11820 


syntaxin SYP121 




C24 


At5g40480 


nuclear pore protein -like 




C24 


At2g35920 


putative ATP-dependent RNA helicase A 




C24 


At1g22490 


expressed protein 




C24 


At5g14070 


glutaredoxin-like protein 




C24 


NP_051048 


ribosomal protein S2 




C24 


At4g19750 


grycosyl hydrolase family 18 




C24 


At1g25340 


myb-related transcription factor (cpm7), putative 




C24 


At2g25140 


HSP100/ClpB, putative 




C24 


At2g20960 


pEARLt 4 protein 




C24 


At2g27480 


putative calcium binding protein 




C24 


AAD03443 


contains similarity to human RNA polymerase II complex 
component SRB7 (GB:U52960) 




C24 


At5g03940 


signal recognition particle 54CP protein precursor 


1 


C24 
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At1g07430 


protein phosphatase 2C (PP2C), putative 




C24 


At4g07960 


putative glucosyltransferase 




C24 


At3g11630 


putative 2-cys peroxiredoxin BAS1 precursor (thiot-specific 
antioxidant protein) 




C24 


At4g01310 


putative L5 ribosomal protein 




C24 


At5g61250 


glycosyl hydrolase family 79 (endo-beta-glucuronidase/heparanase) 




C24 


At1g65010 


hypothetical protein 




C24 


At1g67810 


unknown protein 




C24 


At5g50260 


cysteine proteinase 




C24 


At5g64040 


photosystem I reaction center subunit PSI-N precursor (PSI-N) 




C24 


At5g24770 


vegetative storage protein Vsp2 




C24 


At4g28630 


ABC transporter family protein 




C24 


At5g09730 


glycosyl hydrolase family 3 




C24 


At4g31300 


20S proteasome beta subunit A (PBA1); 




C24 


T05498 


hypothetical protein T19K4. 190 




C24 


AC000103_3 


unknown protein 




C24 


At5g09700 


beta-glucosidase - like protein 




C24 


At5g15200 


40S ribosomal protein - like 




C24 


At1g72300 


leucine-rich repeat transmembrane protein kinase, putative 




C24 


At3g14350 


leucine-rich repeat transmembrane protein kinase, putative 




C24 


At3g13160 


expressed protein 




C24 


At5g59660 


leucine-rich repeat transmembrane protein kinase, putative 




C24 


At3g05400 


sugar transporter, putative 




C24 


At5g54290 


cytochrome c biogenesis protein precursor (gb|AAF35369.1) 




C24 


At1g75350 


chloroplast50S ribosomal protein L31, putative 




C24 


At1g75350 


chloroplast 50S ribosomal protein L31, putative 




C24 


At3g54890 


light-harvesting chlorophyll a/b binding protein 




C24 


At5g47180 


VAMP (vesicle-associated membrane protein)-associated protein- 
like 




C24 


At5g41610 


Na+/H+ antiporter-like protein 




G24 


AC002423_15 


unknown protein 




C24 


At5g58490 


cinnamoyl-CoA reductase - like protein 




C24 


C86379 


unknown protein 




C24 


At1g19640 


S-adenosyl-L-methionine:jasmonic acid carboxyl methyltransferase 
(JMT) 




C24 


At5g04290 


glycine-rich protein 




C24 


At1g35680 


50S ribosomal protein L21 chloroplast precursor (CL21) 




C24 
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NP_051097 


ribosomal protein L22 




C24 


At5g48600 


chromosome condensation protein 




C24 


At3g43190 


sucrose synthase, putative 




C24 


At3g26790 


transcriptional regulator (FUSCA3) 




C24 


At4g17300 


asparagine-tRNA ligase 




C24 


At1g31000 


hypothetical protein 




C24 


BAB02913 


unknown protein 




C24 


AAK64154 


unknown protein 




C24 


AAB61690 


disease resistance protein homolog 




C24 


At2g24490 


putative replication protein A1 




C24 


At5g66190 


ferredoxin-NADP+ reductase 




C24 


At3g22910 


potential calcium-transporting ATPase 13, plasma membrane-type 
(Ca2+-ATPase, isoform 13) 




C24 


At5g48310 


putative protein 




C24 


At5g28300 


GTL1 - like protein 




C24 


At1g29930 


light-harvesting chlorophyll a/b binding protein 




C24 


At5g09660 


microbody NAD-dependent malate dehydrogenase 




C24 


At3g11820 


syntaxin SYP121 




C24 


At3g51570 


disease resistance protein (TIR-NBS-LRR class), putative 




C24 


At3g46530 


disease resistance protein, RPP13-like (CC-NBS class), putative 




C24 


At2g25710 


biotin hoiocarboxylase synthetase 




C24 


At2g12150 


Mutator-like transposase 




C24 


At5g32481 


Athila retroelement ORF1, putative 




C24 


At3g24190 


expressed protein 




C24 


At1g19390 


WAK-like kinase (WLK) 




C24 


At4g22470 


extensin - like protein 




C24 


At1g47560 


hypothetical protein 




C24 


At2g16780 


putative WD-40 repeat protein, MSI2 




C24 


At1g18030 


protein phosphatase 2C (PP2C), putative 




C24 


At3g23790 


AMP-binding protein, putative 




C24 


At1g66530 


arginyl-tRNA synthetase 




C24 


At1g48150 


MADS-box protein 






At4g14140 


(cytosine-5-)-methyltransferase 




C24 


At1g62940 


4-coumarate:ccenzyme A ligase, putative 




C24 


At2g29500 


putative small heat shock protein 




C24 


At3g07980 


putative MAP3K epsilon protein kinase 




C24 


At4g23940 


putative MAP3K epsilon protein kinase 




C24 
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At1g21810 


myosin-like protein 




C24 


At5g14950 


glycosyl hydrolase family 38 (alpha-mannosidase) 




C24 


At3g53280 


cytochrome P450 monooxygenase 




C24 


At2g41310 


putative two-component response regulator 3 protein 




C24 


At1g50410 


DNA-binding protein, putative 




C24 


At5g05340 


peroxidase, putative 




C24 


A96721 


probable peptide transporter 




C24 


At2g26790 


putative salt-inducible protein 




C24 


At3g30570 


putative reverse transcriptase 






At1g74080 


putative transcription factor 




C24 


At3g21210 


CHP-rich zinc finger protein, putative 




C24 


At2g42270 


U5 small nuclear ribonucleoprotein helicase, putative 




C24 


At1g69320 


CLE10, putative 




C24 


At3g42950 


polygalacturonase, putative 




C24 
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Table 3: List of the proteins identified in the LC/MS/MS analysis as depicted in 
Figure 4A and via database search 



6.57 
e+011 


26/112 
(23%) 


96024.5/ 
8.27 


>gi|15234315|ref|NP_192920.1| putative phospholipase D- 
gamma [Arabidopsis thaliana] 


3.05 
e+011 


27/112 
(24%) 


92351.2/ 
6.20 


>gi|15240282|ref|NP_200964.1 1 putative protein [Arabidopsis 
thaliana] 


1.52 
e+011 


28/112 
(25%) 


92718.8/ 
5.43 


>gi|1 5242062|ref|NP_1 97578.1 1 putative protein [Arabidopsis 
thaliana] 


2.24 
e+010 


24/112 
(21%) 


86209.1 / 
6.18 


(AB01 2239) contains similarity to receptor-like protein 
kinase~gene_id:K1 1J9.9 


1.82 

e+010 


22/112 
(19%) 


73879.2/ 
8.66 


>gi|4220445|gb|AAD1 2672.1 1 (AC006216) Similar to 
gi|3004555 F19F24.14 salt inducible protein homolog from 
Arabidopsis thaliana BAC gb|AC003673 


e+009 


(iSI) 


ou i oo.o / 
5.98 


•^niM ExOOA ^OHIroflMP 1 Tfi^lAO 1 1 amino rvvirlaeo ni itatix/a 
>yl| I Oc-C. IOi7U|lci jl^ir I /OH-oy. 1 1 dlllH ie OXIUdofc?, pUldllVc 

[Arabidopsis thaliana] 


e+009 


(16%) 


AA-tQO A 1 

9.25 


(AC013258) unknown protein; 14-1201 


e+009 


(23%) 


67135.1 / 
6.21 


>gi|io^<ioo4o|reT|Nr_iyuoyo.i[ ceil division cycie protein zd 
homolog [Arabidopsis thaliana] 


e+009 


(15%) 


ronnc e: / 

6.24 


>gi|io^ou44/|ret|Nf J _iyu/ou.i 1 putative protein [Arabidopsis 
thaliana] 


e+009 


(21%) 


yam a c; / 
/yu i h.o / 

7.84 


>gi| i o^o^^:o|reT|i>ir_iy<i roo. i| pmaiive cuiim-iiKe i protein 
[Arabidopsis thaliana] 


e+009 


(25%) 


Oi?UO / .y / 

8.65 


>yi| i o^:ocw-yo|rci|i>ir_ i \3£. so^. 1 1 [lypoineiioai proiem 
[Arabidopsis thaliana] 


e+009 


(14%) 


QQAQ-i Q I 

oy4o i .» / 
5.78 


>gi|io£ii:obyo|ret|NP_ioit>«;*;.i| putative isoamyiase 
[Arabidopsis thaliana] 


e+009 


(22%) 


761 15.8/ 
9.03 


>gi| lOii ly^/ i|reT|iMr_i /o/o/. i 1 nypotneticai protein 
Arabidopsis thaliana] 


e+009 


(22%) 


OOOOU.D / 

9.10 


(AC001229) Similar to Arabidopsis cytochrome P450 CYP90 
(gb|X87367). 


1.12 
e+009 


25/112 
(22%) 


551 28.6 / 
8.60 


>gi|1 521 8776|ref|NP 1 76744.1 1 cytochrome P450, putative 
[Arabidopsis thalianaj 


1.08 
e+009 


21/112 
(18%) 


34622.9/ 
8.50 


>gi|15241862|ref|NP_201058.1| putative protein [Arabidopsis 
thaliana] 


9.98 
e+008 


23/112 
(20%) 


70809.8/ 
9.19 


>gi|15228449|ref|NP_1 86956.1 1 putative 26S proteosome 
regulatory subunit [Arabidopsis thaliana] 


9.57 
e+008 


25/112 
(22%) 


91655.7/ 
9.43 


>gi|1 5242321 |ref|NP_1 96478.1 1 RNA helicase-like protein 
Arabidopsis thaliana] 


7.44 
e+008 


18/112 
(16%) 


50284.9/ 
6.58 


>gi|15228443|ref|NP_1 89791 .1 1 vacuolar H(+)-ATPase 
subunit-like protein [Arabidopsis thaliana] 


7.43 
e+008 


19/112 
(16%) 


50357.9/ 
6.58 


(AY0371 76) AT3g42050/F4M1 9_1 0 
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7.14 
e+008 


17/112 
(15%) 


59440.9 / 
9.67 


(AC008046) Hypothetical protein 


6.34 
e+008 


17/112 
(15%) 


94851.5/ 
6.35 


(AF364174) chromomethylase 3 


5.61 
e+008 


26/112 
(23%) 


79480.6/ 
6.49 


>gi|15222507|ref|NP_1 76554.1 1 unknown protein [Arabidopsis 
thaliana] 


4.8 
e+008 


22/112 
(19%) 


95794.4/ 
6.19 


(AC007980) Very similar to disease resistance proteins 


4.5 
e+008 


13/112 
(11%) 


77293.1 / 
8.80 


>gi|1 52401 14jref|NP_1 98530.1 1 senl-like protein [Arabidopsis 
thaliana] 


4.47 
e+008 


21/112 
(18%) 


95834.2 / 
8.32 


>gi|15237673|ref|NP„200654.1| putative protein [Arabidopsis 
thaliana] 


4.44 
e+008 


27/1 12 
(24%) 


91155.0/ 
8.56 


(AJ299417) hypothetical protein 


4.44 
e+008 


13/112 
(11%) 


78390.5 / 
8.70 


(AB01 7068) contains similarity to nonsense-mediated mRNA 
decay trans-acting factors~gene_id:MJG14.20 


3.9 
e+008 


20/112 
(17%) 


97275.5 / 
6.18 


>gi[15222893|ref|NP_1 75437.1 1 disease resistance protein, 
putative [Arabidopsis thaliana] 


2.82 
e+008 


21/112 
(18%) 


63750.1 / 
6.15 


(AY056217) putative cell division cycle protein 23 


2.57 
e+008 


24/112 
(21%) 


73831.6/ 
6.43 


(AF281 655) zeaxanthin epoxidase 


2.57 
e+008 


24/112 
(21%) 


73842.7 / 
6.60 


>gi|1 52401 69|ref|NPj201504.1| zeaxanthin epoxidase 
precursor [Arabidopsis thaliana] 


2.56 
e+008 


29/112 
(25%) 


92376.5 / 
6.53 


>gi[15232837[ref|NP_1 86850.1 1 hypothetical protein 
[Arabidopsis thaliana] 


2.52 
e+008 


20/112 
(17%) 


92074.5 / 
6.60 


(AF277982) origin recognition complex 1 


2.13 
e+008 


20/1 12 
(17%) 


97220.8 / 
6.89 


(AB052756) SRKb 


1.93 
e+008 


22/112 
(19%) 


73984.2 / 
7.50 


>gi|15221081|ref|NP_1 72636.1 1 putative salt-inducible protein 
[Arabidopsis thaliana] 


1.91 
e+008 


20/1 12 
(17%) 


90361.3/ 
6.01 


>gi|15225790|ref|NP_1 80247.1 1 putative salt-inducible protein 
[Arabidopsis thaliana] 


1.79 
e+008 


26/112 
(23%) 


73268.2 / 
5.11 


>gi|15229907|ref|NP 1 87164.1 1 hypothetical protein 
[Arabidopsis thaliana] 


1.77 
e+008 


15/112 
(13%) 


80208.1 / 
5.72 


>gi|15241843|ref|NP 198205.1 1 far-red impaired response 
protein (FAR1) - like [Arabidopsis thaliana] 


1.73 
e+008 


19/112 
(16%) 


81148.9/ 
8.74 


(AP000377) long-chairvfatty-acid CoA ligase 


1.73 
e+008 


25/112 
(22%) 


79706.0/ 
7.19 


>gi|15223594|ref|NP_1 76062.1 1 hypothetical protein 
Arabidopsis thaliana] 


1.7 
e+008 


18/112 
(16%) 


63806.0/ 
5.93 


>gi|1522951 8|ref |NP_1 89021 .1 1 AMP-binding protein, putative 
Arabidopsis thaliana] 


1.63 
e+008 


20/112 
(17%) 


53553.2/ 
11.48 


>gi|15225620|ref|NP_1 81536.1 1 En/Spm-like transposon 
protein [Arabidopsis thaliana] 
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1.59 
e+008 


23/112 
(20%) 


98473.6 / 
5.64 


>gi|15239960|ref|NP 1 99187.1 1 disease resistance protein 
Arabidopsis thatiana] 


1.5 
e+008 


24/112 
(21%) 


73854.7 / 
6.60 


(AB030296) AtABAI 


1.47 
e+008 


21/112 
(18%) 


58491.7/ 
5.71 


>gi|1 524201 9|ref |NP_20051 3.1 |lycopene epsilon cyclase 
Arabidopsis thaliana] 


1.47 
e+008 


21/112 
(18%) 


58515.7/ 
5.79 


LYCOPENE EPSILON CYCLASE, CHLOROPLAST 
PRECURSOR 


1.2 
e+008 


20/112 
(17%) 


88237.6 / 
9.02 


>gi|15220762|ref|NP_1 76422.1 1 hypothetical protein 
Arabidopsis thaliana] 


1.18 
e+008 


20/1 1 2 
(17%) 


89309.9 / 
9.07 


(AC0031 1 3) F2401 .4 


1.15 
e+008 


22/112 
(19%) 


82749.0 / 
5.21 


>gi|15240560|ref|NP_1 99794.1 1 putative protein [Arabidopsis 
thaliana] 


1.08 
e+008 


15/112 
(13%) 


69816.5 / 
6.26 


>gi[15227604[ref|NP 1 80530.1 1 anthranilate synthase, alpha 
subunit [Arabidopsis thaliana] 


1.08 
e+008 


15/112 
(13%) 


69880.6 / 
6.34 


anthranilate synthase (EC 4.1.3.27) alpha-2 chain 


1.04 
e+008 


14/112 
(12%) 


18350.4/ 
4.18 


>gi[15221030|ref|NP 1 73259.1 1 calcium-binding protein, 
autative [Arabidopsis thaliana] 


1.01 
e+008 


17/112 
(15%) 


69901 .5 / 
4.97 


>gi|1 521 8356|ref|NP_1 7501 7.1 1 hypothetical protein 
Arabidopsis thaliana] 


9.4 
e+007 


27/1 1 2 
(24%) 


83768.8 / 
6.06 


>gi|1 522731 6|ref|NP 179280.1 1 putative salt-inducible protein 
[Arabidopsis thaliana] 


9.34 
e+007 


15/112 
(13%) 


94906.6 / 
6.37 


(AF3831 70) chromomethylase CMT3 


9.24 
e+007 


19/1 12 
(16%) 


86999.6 / 
9.41 


hypothetical protein F2401 .3 


9,00 
E+07 


14/112 
(12%) 


62891 .3 / 
9.76 


>gi|15242752|ref|NP 201 144.1 1 auxin-independent growth 
promoter-like protein [Arabidopsis thaliana] 


8.77 
e+007 


18/112 
(16%) 


87505.3 / 
6.84 


>gi|15234605|ref|NP 194731.1 1 awaiting functional 
assignment [Arabidopsis thaliana] 


8.29 
e+007 


14/112 
(12%) 


77819.7/ 
6.00 


deltal -pyrroline-5-carboxylate synthetase [imported] 


8.07 
e+007 


26/1 12 
(23%) 


64010.5 / 
6.57 


>gi|15228825|ref|NP 1 88906.1 1 hypothetical protein 
[Arabidopsis thaliana] 


8.05 
e+007 


22/1 12 
(19%) 


57561 .3 / 
5.97 


glutathione synthase (EC 6.3.2.3) 2 precursor, chloroplast 


7.93 
e+007 


21/112 
(18%) 


96451.3/ 
9.06 


(AP00041 7) beta-1 ,4-xylosidase 


7.41 
e+007 


17/112 
(15%) 


45861.3/ 
5.51 


>gi|1 521 9628|ref|NP_1 76807.1 1 unknown protein [Arabidopsis 
thaliana] 


7.1 
e+007 


14/112 
(12%) 


94905.6/ 
6.57 


>gi|15222449|ref|NP_1 771 35.1 1 putative chromomethylase 
[Arabidopsis thaliana] 


6.97 
e+007 


13/112 
(11%) 


41782.4/ 
5.12 


(AF418279) 2-oxoglutarate-dependent dioxygenase 
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6.96 
e+007 


13/112 
(11%) 


41892.5/ 
5.19 


(AF418283) 2-oxoglutarate-dependent dioxygenase 


6.93 
e+007 


16/112 
(14%) 


42066.7/ 
5.25 


(AF418277) 2-oxogtutarate-dependent dioxygenase 


6.91 
e+007 


13/112 
(11%) 


42193.8/ 
5.19 


(AF41 8278) 2-oxoglutarate-dependent dioxygenase 


6.65 
e+007 


21/112 
(18%) 


68357.3/ 
5.22 


>gi|15223533|ref|NP_1 76036.1 1 heat shock protein, putative 
Arabidopsis thaliana] 


6.62 
e+007 


16/112 
(14%) 


92104.5/ 
6.67 


>gi|15219046|ref|NP_175671 .1 1 hypothetical protein 
[Arabidopsis thaliana] 


6.55 
e+007 


22/112 
(19%) 


85251.5/ 
6.12 


>gi|15220477|ref|NP_176925.1 1 putative protein kinase 
[Arabidopsis thaliana] 


6.49 
e+007 


16/112 
(14%) 


44914.1 / 
5.26 


(AF41 8276) 2-oxoglutarate-dependent dioxygenase 


6.48 
e+007 


16/112 
(14%) 


44936.1 / 
5.40 


(AF41 8274) 2-oxoglutarate-dependent dioxygenase 


6.48 
e+007 


16/112 
(14%) 


44943.1 / 
5.26 


(AF417859) AOP3 


6.48 
e+007 


13/112 
(11%) 


44944.1 / 
5.20 


(AF41 8275) 2-oxoglutarate-dependent dioxygenase 


6.12 
e+007 


26/112 
(23%) 


97742.7/ 
8.10 


>gi|15236576|ret|NP_1 9261 2.1 1 putative protein [Arabidopsis 
thaliana] 


5.96 
e+007 


21/112 
(18%) 


59364.9/ 
9.45 


>gi|15236834|ref|NP_1 94398.1 1 putative protein [Arabidopsis 
thaliana] 


5.83 
e+007 


14/112 
(12%) 


49681.4/ 
8.54 


>gi|15241740|ref]NP_198757.1| unknown protein [Arabidopsis 
thaliana] 


5.79 
e+007 


22/112 
(19%) 


73789.6/ 
6.43 


(AF1 34577) zeaxanthin epoxidase 


5.22 
e+007 


26/112 
(23%) 


99549.4 / 
8.45 


(AC01 1708) putative DNA gyrase A subunit 


5.2 
e+007 


13/112 
(11%) 


74888.6/ 
6.92 


>gi|15238566|ref|NP_200801 .1 1 putative protein [Arabidopsis 
thaliana] 


5.04 
e+007 


21/112 
(18%) 


74663.7/ 
5.76 


>gi|15234413|ref|NP_194549.1 1 putative protein [Arabidopsis 
thaliana] 


5.01 
e+007 


18/112 
(16%) 


81414.5/ 
6.08 


>gi|15236567|ref|NP_192606.1 1 putative MuDR-like 
transposon protein [Arabidopsis thaliana] 


4.87 
e+007 


16/112 
(14%) 


81078.9/ 
5.65 


>gi|1 523401 7|ref|NP_1 9421 5.1 1 brefeldin A-sensitive Golgi 
protein - like [Arabidopsis thaliana] 


4.75 
e+007 


20/112 
(17%) 


92699.6/ 
8.31 


(AF1 38281) phosphotipase D-gamma-2 


4.66 
e+007 


16/112 
(14%) 


84654.0/ 
5.53 


(AF367300) AT4g24840/F6l7_50 


4.63 
e+007 


19/112 
(16%) 


65466.2/ 
8.37 


(AF369565) membrane protein Mlo4 


4.61 
e+007 


14/112 
(12%) 


79492.1 / 
8.17 


>gi[15238268|ref|NP_1 96088.1 1 putative protein [Arabidopsis 
thaliana] 
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4.53 
e+007 


22/112 
(19%) 


79498.0/ 
7.20 


helicase homolog T6H20.10 


4.47 
e+007 


16/112 
(14%) 


89843.5/ 
7.57 


>gi|1 5241 340|ref|NP_1 96925.1 1 receptor protein kinase-like 
Drotein [Arabidopsis thaliana] 


4.33 
e+007 


19/112 
(16%) 


78944.3/ 
9.20 


>gi|15230744|ref|NP_1 89651.1 1 hypothetical protein 
Arabidopsis thaliana] 


4,00 
E+07 


16/112 
(14%) 


62984.6/ 
8.87 


>gi|15232344|ref|NP_1 88709.1 1 unknown protein [Arabidopsis 
thaliana] 


3.94 
e+007 


14/112 
(12%) 


38665.0/ 
4.84 


>gi|15240470|ref|NP_200326.1| putative protein [Arabidopsis 
thaliana] 


3.84 
e+007 


17/112 
(15%) 


79023.3/ 
6.04 


(AC069251)F2D10.23 
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Claims 

1. A method for providing data useful for quantitatively analyzing metabolites, 
proteins and/or RNA in a biological source material, whereby said analysis 
involves suitable statistical evaluation and correlation analysis on the data 
obtained, 

said method being characterized by the step of extracting, identifying and 
quantifying at least two compound classes of the group consisting of metabolites, 
proteins and RNA from at least one sample from said biological source material, 
wherein the compounds of said at least two classes are each determined from one 
sample. 

2. The method of claim 1, wherein said analysis furthermore involves network 
analysis. 

3. The method of claim 1 or 2, wherein extracting, identifying and quantifying is 
carried out taking a multitude of samples. 

4. The method of any on of claims 1 to 3, wherein said extracting comprises the 
steps of: 

(a) extracting the metabolites from said sample with at least one solvent or 
mixture of solvents; 

(b) extracting the proteins from the remainder of the sample after step (a); 

(c) extracting the RNA from the remainder of the sample after step (a); and 

(d) optionally dissolving remaining cellular material contained in said sample. 

5. The method of claim 4, wherein said mixture of solvents comprises at least one 
highty polar solvent, at least one less polar solvent and at least one lipophilic 
solvent. 

6. The method of claim 5, wherein said mixture of solvents comprises water, 
methanol and chloroform. 
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7. The method of claim 6, wherein said mixture of solvents contains water, methanol 
and chloroform in the approximate proportion by volume of 1 : 2.5: 1 . 

8. The method of any one of claims 4 to 7, wherein step (a) is carried out at a 
temperature between -60 and +4°C. 

9. The method of any one of claims 1 to 8, further comprising removing detection- 
disturbing compounds from the metabolites, the polypeptides and/or the RNA prior 
to identifying and quantifying the metabolites, proteins and/or RNA. 

10. The method of claim 9, wherein the detection-disturbing compounds are 
carbohydrates or other compounds that disturb identification and quantification of 
RNA. 

11. A method for quantitatively analyzing metabolites, proteins and/or RNA in a 
biological source material comprising 

(a) providing data on metabolites, proteins and/or RNA in said biological 
source material according to the method of any one of claims 1 to 10; 

(b) performing suitable statistical evaluation and correlation analysis on the 
data obtained; and 

(c) optionally further performing a network analysis on the data obtained in 
step (b). 

12. Use of a mixture of solvents as defined in any one of claims 4 to 8 for extracting 
metabolites from a sample of a biological source material in order to perform 
metabolite profiling. 

13. The use of claim 12, wherein additionally proteins and/or RNA is extracted from 
said sample. 
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search has been carried out for those parts of the claims which appear to 
be clear, supported and disclosed, namely methods comprising the removal 
of carbohydrates, that disturb identification and quantification of RNA . 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 



